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Abstract 



The quantum dynamics and grand canonical thermodynamics of many-mode (one-, 
two-, and three-dimensional) interacting Bosc gases are simulated from first princi- 
ples. The model uses a lattice Hamiltonian based on a continuum second-quantized 
model with two-particle interactions, external potential, and interactions with an 
environment, with no further approximations. The interparticle potential can be 
either an (effective) delta function as in Bose-Hubbard models, or extended with a 
shape resolved by the lattice. 

Simulations are of a set of stochastic equations that in the limit of many realiza- 
tions correspond exactly to the full quantum evolution of the many-body systems. 
These equations describe the evolution of samples of the gauge P distribution of the 
quantum state, details of which are developed. 

Conditions under which general quantum phase-space representations can be 
used to derive stochastic simulation methods are investigated in detail, given the 
criteria: 1) The simulation corresponds exactly to quantum mechanics in the limit 
of many trajectories. 2) The number of equations scales linearly with system size, to 
allow the possibility of efficient first-principles quantum mesoscopic simulations. 3) 
All observables can be calculated from one simulation. 4) Each stochastic reahzation 
is independent to allow straightforward use of parallel algorithms. Special emphasis 
is placed on allowing for simulation of open systems. In contrast to typical Monte 
Carlo techniques based on path integrals, the phase-space representation approach 
can also be used for dynamical calculations. 

Two major (and related) known technical stumbling blocks with such stochastic 
simulations are instabilities in the stochastic equations, and pathological trajectory 
distributions as the boundaries of phase space are approached. These can (and 



often do) lead to systematic biases in the calculated observables. The nature of 
these problems are investigated in detail. 

Many phase-space distributions have, however, more phase-space freedoms than 
the minimum required for exact correspondence to quantum mechanics, and these 
freedoms can in many cases be exploited to overcome the instability and boundary 
term problems, recovering an unbiased simulation. The stochastic gauge technique, 
which achieves this in a systematic way, is derived and heuristic guidelines for its 
use are developed. 

The gauge P representation is an extension of the positive P distribution, which 
uses coherent basis states, but allows a variety of useful stochastic gauges that are 

used to overcome the stability problems. Its properties are investigated, and the 
resulting equations to be simulated for the open interacting Bose gas system are 
derived. 

The dynamics of the following many-mode systems are simulated as examples: 
1) Uniform one-dimensional and two-dimensional Bose gases after the rapid appear- 
ance of significant two-body collisions (e.g. after entering a Feshbach resonance). 2) 
Trapped bosons, where the size of the trap is of the same order as the range of the 
interparticle potential. 3) Stimulated Bose enhancement of scattered atom modes 
during the collision of two Bose-Einstein condensates. The grand canonical thermo- 
dynamics of uniform one-dimensional Bose gases is also calculated for a variety of 
temperatures and collision strengths. Observables calculated include first to third 
order spatial correlation functions (including at finite interparticle separation) and 
momentum distributions. The predicted phenomena are discussed. 

Improvements over the positive P distribution and other methods are discussed, 
and simulation times are analyzed for Bose-Hubbard lattice models from a general 
perspective. To understand the behavior of the equations, and subsequently optimize 
the gauges for the interacting Bose gas, single- and coupled two-mode dynamical 
and thermodynamical models of interacting Bose gases are investigated in detail. 
Directions in which future progress can be expected are considered. 

Lastly, safeguards are necessary to avoid biased averages when exponentials of 
Gaussian-like trajectory distributions are used (as here), and these are investigated. 
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Thesis Rationale and Structure 

Rationale 

It is a common view that first-principles quantum simulations of mesoscopic dy- 
namics are intractable because of the complexity and astronomical size of the rele- 
vant Hilbert space. The following quotes illustrate the significance of the problem: 
include HIE]: 

"Can a quantum system he probabilistically simulated by a classical uni- 
versal computer? . . . the answer is certainly, No!" (Richard P. Feynman, 
1982). 

"One is forced to either simulate very small systems (i.e. less than 
five particles) or to make serious approximations" (David M. Ceper- 
ley, 1999). 

This is certainly true if one wishes to follow all the intricate details of a wavefunc- 
tion that completely specifies the state of the system. Hilbert space size grows 
exponentially as more subsystems (e.g. particles or modes) are added, and meth- 
ods that calculate state vectors or density matrix elements bog down very quickly. 
Path integral Monte Carlo methods also fail because of the well-known destructive 
interference between paths that occurs when one attempts dynamics calculations. 

Such a situation appears very unfortunate because for many complex physical 
systems a reliable simulation method is often the only way to obtain accurate quan- 
titative predictions or perhaps even a well-grounded understanding. This is par- 
ticularly so in situations where several length/time/energy scales or processes are 
of comparable size/strength, or when non-equilibrium phenomena are important. 
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The need for reliable quantum dynamics simulations can be expected to become 
ever more urgent as more mesoscopic systems displaying quantum behavior are ac- 
cessed experimentally. A pioneering system in this respect are the Bose-Einstein 
condensates of alkali- atom gases realized in recent years El El EI- 

There is, however, a very promising simulation method using phase-space repre- 
sentations that works around this complexity problem. In brief, a correspondence 
is made between the full quantum state and a distribution over separable operator 
kernels, each of which can be specified by a number of variables linear in the num- 
ber of subsystems (e.g. modes). If one then samples the operator kernels according 
to their distribution, then as the number of samples grows, observable averages of 
these operators approach the exact quantum values, i.e. "emerge from the noise". 
In principle one could reach arbitrary accuracy, but in practice computer power of- 
ten severely limits the number of samples. Nevertheless, if one concentrates only on 
bulk properties, and is prepared to sacrifice precision beyond (typically) two to four 
significant digits, many first-principles quantum mesoscopic dynamics results can be 
obtained. The mesoscopic region can be reached because simulation time scales only 
log-linearly^ (iVvar log A^var) with variable number A^ar, and so still scales log-linearly 
with system size. Some initial examples of calculations with this method, particu- 
larly with the positive P representation jTUl ^] based on a separable coherent state 
basis, are many-mode quantum optics calculations in Kerr dispersive media [T^ ITH] 
evaporative cooling of Bose gases with repulsive delta-function interactions [Hj IT5]. 
and breathing of a trapped one-dimensional Bose gaspp. 

In summary, first-principles mesoscopic quantum dynamics is hard, but some 
progress can (and has) been made in recent years. In broad terms, the aim of the 
work reported in this thesis is to advance the phase-space simulation methods some 
more . The investigation here is carried out in several directions: 



^This is because discrete Fourier transforms, usually required for kinetic energy evaluation, 
can be calculated on timescales proportional to iVvar log -/Vvar ■ For some particularly demanding 
models, the scaling may be log-polynomial in A^var due to an increased number of terms in the 
equation for each variable if there is complicated coupling between all subsystem pairs, triplets, 
etc. In any case, simulation time never scales exponentially, as it would for brute force methods 
based on density matrix or state vector elements. 
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1. Stochastic gauges Many phase-space distributions have more phase-space 
degrees of freedom than the minimum required for exact correspondence to 
quantum mechanics. Each such degree of freedom leads to possible modi- 
fications of the stochastic equations of motion by the insertion of arbitrary 
functions or gauges in an appropriate way. While the choice of these gauge 
functions does not influence the correspondence to quantum mechanics in the 
limit of infinitely many samples, it can have an enormous effect on the efficiency 
and/or statistical bias of a simulation with a finite number of samples. (Hence 
the use of the word "gauge" , in analogy with the way electromagnetic gauges 
do not change the physical observables but can have an important effect on the 
ease with which a calculation proceeds). In particular, non-standard choices 
of these gauge functions can lead to enormous improvements in simulation 
efficiency, or can be essential to allow any unbiased simulation at all. 

Here, a systematic way to include these freedoms is derived and ways of making 
an advantageous gauge choice are considered in some detail. As a corollary, 
some results present in the literature [21 d El are found to be examples of 
non-standard gauge choices. 

The gauge P representation, which is a generalization of the successful positive 
P representation based on a coherent state basis to allow a variety of useful 
gauges, is explained. Application of it to interacting Bose gases is developed. 

2. Removal of systematic biases using stochastic gauges. Two major (and 
related) stumbling blocks for phase-space distribution methods have been in- 
stabilities in the stochastic equations, and pathological trajectory distributions 
as the boundaries of phase space are approached. These occur for nonlinear 
systems and can (in fact, apart from special cases, do) lead to overwhelming 
noise or systematic biases in the calculated observables, preventing dependable 
simulations. To date this has been "problem number one" for these methods. 

In this thesis it is shown how appropriate stochastic gauges can be used to over- 
come the instability and boundary term problems in a wide range of models, 
recovering an unbiased simulation, and opening the way for reliable simula- 



tions. Heuristic ways of achieving this in general cases are considered in detail, 
and examples are given for known cases in the literature. 

3. Improvement of efficiency using stochastic gauges The stochastic gauge 
method also has the potential to significantly improve the efficiency of simu- 
lations when appropriate gauges are chosen. In particular, the time for which 
useful precision is obtained can be extended in many cases by retarding the 
growth of noise. This is investigated for the case of gauge P simulations of 
interacting Bose gases and some useful gauges obtained. The regimes in which 
improvements can be seen with the gauges developed here are characterized. 
Heuristic guidelines for gauge choice in more general representations and mod- 
els are also given. Example simulations are made. 

4. General requirements for usable distributions in open systems An- 
other aim here is to determine what are the bare necessities for a phase-space 
distribution approach to be successful, so that other details of the representa- 
tion used (e.g. choice of basis) can be tailored to the model in question. This 
can be essential to get any meaningful results, as first-principles mesoscopic 
simulations are often near the limit of what can be tractably calculated. To 
this end, general features of the correspondence between quantum mechanics 
and stochastic equations for the variables specifying the operator kernels are 
considered in some detail. 

Particular concern is given to simulating open systems, as most experimentally 
realizable systems do not exist in isolation and will have significant thermal and 
also particle-exchange interactions with external environments. These must 
usually be taken into account for an accurate description. (The alternative is 
to make wholesale approximations to the model whose precise effect is often 
difficult to ascertain). 

5. Application to thermodynamic calculations A separate issue are static 
calculations of thermodynamic equilibrium ensembles. These have tradition- 
ally been the domain of quantum Monte Carlo methods, the most versatile 
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of which have been those based on the path integral approach. While path 
integral methods are generally not useful for dynamical calculations because 
destructive interference between the paths occurs extremely rapidly, masking 
any dynamics, good results can be obtained for static thermodynamic ensem- 
ble calculations. If the model is kind, even O (10^) particles or modes can be 
successfully simulated, if one again concentrates on bulk properties and only 
several significant digits. 

Phase-space representations that include dynamically changing trajectory weights 
can also be used for thermodynamics calculations. The thermal density matrix 
is evolved with respect to inverse temperature after starting with the known 
high temperature state. (Such simulations are sometimes said to be "in imag- 
inary time" because of a similarity between the resulting equations and the 
Schrodinger equation after multiplying time by i). This approach appears 
competitive with path integral Monte Carlo methods in terms of efficiency, 
but offers two distinct advantages: Firstly, all observables can in principle be 
calculated in a single simulation run, which is not the case in path integral 
methods. These latter require separate algorithms for e.g. observables in posi- 
tion space, momentum space, or observables not diagonal in either. Secondly, 
a single simulation run gives results for a range of temperatures, while in path 
integral methods a new simulation is needed for each temperature value. 

6. Demonstration with non-trivial examples Lastly, but perhaps most im- 
portantly, one wishes to demonstrate that the results obtained actually are 
useful in non-trivial cases. Simulations of dynamics and thermodynamics in 
a variety of mesoscopic interacting systems are carried out, and their phys- 
ical implications considered. The emphasis will be foremost on simulations 
of interacting Bose gases. It is chosen to concentrate on these because they 
are arguably the systems where quantum effects due to collective motion of 
atoms are most clearly seen and most commonly investigated by contemporary 
experiments. This refers, of course, to the celebrated Bose-Einstein conden- 
sate experiments on cold, trapped, rarefied, alkali-metal gases. (For a recent 
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overview see, for example, the collection of articles in Nature Insight on ul- 
tracold matter fn)\ ITTf IT^ IT?H 1^.) As these gases are an extremely dynamic 
field of research the need for first-principle simulation methods appears urgent 
here. 

The "holy grail" of quantum simulations, which would be a universally-applicable 
black-box tractable simulation method^, probably does not exist. However, sim- 
ulations are of such fundamental importance to reliable predictions in mesoscopic 
physics that any significant progress has the potential to be a catalyst for far-ranging 
discoveries (and in the past often has). 

Structure 

This thesis begins with two introductory chapters, which explain in more detail the 
background issues. That is. Chapter ^discusses the fundamentals of why many-body 
quantum simulations are difficult, compares several approaches, and motivates the 
choice to work on the mode-based phase-space representation simulation methods 
considered in this thesis. Chapter |21 summarizes the interacting Bose gas model 
that will be considered in all the simulation examples, and discusses under what 
circumstances a first-principles rather than a semiclassical calculation is needed to 
arrive at reliable predictions. 

The body of the thesis is then divided into three parts of a rather different nature. 
Part A investigates what general properties of a phase-space representation are 
needed for a successful simulation, and explains the gauge P representation, which 
will be used in later parts. Part B calibrates this method on some toy problems 
relevant to the interacting Bose g clS C3iSG, while Part C applies them to non-trivial 
mesoscopic systems (dynamics and thermodynamics of interacting Bose gases). 

Accordingly, Chapter IHl presents a generalized formalism for phase-space repre- 
sentations of mixed quantum states and characterizes the necessary properties for 
an exact correspondence between quantum mechanics and the stochastic equations. 

^In Newtonian dynamics this is just the usual "start with initial conditions and integrate the 
differential equations" . 
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The stochastic gauge technique, which forms the basis of the rest of the devel- 
opments in this thesis, is developed in Chapter 01 In Chapter properties of the 
gauge P representation are investigated, and its apphcation to interacting Bose gases 
developed. In Chapter IHl it is explained how stochastic gauges can be used to over- 
come "technical difficulty number one" : the systematic "boundary term" biases that 
otherwise prevent or hinder many attempted phase-space distribution simulations. 
Some relevant technical issues regarding stochastic simulations have been relegated 
to Appendices. 

One- and two-mode toy models are used in Part B to check correctness of the 
method, and to optimize the gauge functions in preparation for the target aim 
of simulating multi-mode systems. These models are the single-mode interacting 
Bose gas dynamics in Chapter [3 the dynamics of two such Rabi-coupled modes in 
Chapter IHl and grand canonical single-mode thermodynamics in Chapter IHl 

Finally, Chapters ^1 and ^2 give examples of nontrivial many-mode simulations 
of many-mode interacting Bose gas dynamics and grand canonical thermodynamics, 
respectively. 

Chapter summarizes and concludes the work along with some speculation on 
fruitful directions of future research. 
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Chapter 1 



Introduction: 

Why are quantum many-body 
simulations difficult and what can 
be done about it? 



Here it is considered how stochastic methods can allow first-principles quantum 
simulations despite the seemingly intractably large Hilbert space required for such a 
task. The main approaches: path integral Monte Carlo, quantum trajectories, and 
phase-space distributions are briefly compared (without going into mathematical 
details of the methods themselves). This thesis develops and applies the stochastic 
gauge technique to make improvements to the phase-space distribution methods, and 
the choice to concentrate on these methods is motivated by their greater versatility. 
The emphasis throughout this thesis on methods applicable to open systems (leading 
to mostly mode-based formulations) is also discussed. 
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1.1 Hilbert space complexity 

In classical mechanics, an exhaustive simulation of an interacting A^-particle system 
is (in principle) computable^ on a classical computer. For the trivial case of a 
non-interacting three-dimensional (3D) gas in a potential, one would start from 
initial conditions in 6A^ real variables^, and then generically simulate by solving 6N 
differential equations. For particles with ranged binary interactions^ there are 6N 
real differential equations, each with O {N) terms, so simulation times scale as N'^ 
— grudgingly computable. 

Let us now look at the quantum case. Consider a generic mesoscopic system, 
divided up (arbitrarily) into N subsystems. Most typically these subsystems would 
be N particles, spatial or momentum modes, or N orbitals. If we allow each mode 
or orbital to be occupied by up to Pmax particles (Pmax = 1 for fermions, and more for 
bosons), then the full state vector of a pure system in such a model has (Pmax + 1)^ 
complex number components. In the other popular formulation with a set number 
of particles N, one would discretize D-dimensional space into Miatt lattice points 
in each dimension, leading to Mj^^.^ lattice points in all. The state vector of such 
a system would then have complex number components. Both of these cases 

are exponential in the system size A'", and the amount of memory needed to hold 
the state (let alone the time needed to evolve the corresponding number of coupled 
differential equations) becomes intractable very rapidly. This is the Hilbert space 
complexity problem, and is generic to all mesoscopic quantum calculations. 

For example, if one had 8000 Gigabytes of memory just to hold the state vector^, 
one could have 40 fermion modes, or 12 boson modes with each one occupied by at 
most 9 atoms. In the set particle number formulation, one could get away with 4 
particles in a (very sparse!) 10 x 10 x 10 three-dimensional lattice, or 12 particles 
in such a one-dimensional lattice. This indicates where the estimate of about "five 

^ "Computable" here having the usual meaning that simulation time scales linearly, or (grudg- 
ingly) polynomially with A''. Naturally if A'' is truly macroscopic such a system still cannot be 
simulated in practice, but in such large c;lassical systems the primary hindrance to accurate pre- 
dictions is usually deterministic chaos rather than simulation speed. 

^Position and momentum in each degree of freedom. 

^As in the majority of first-principles classical models. 

''Assuming 4 byte floating point data. 
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particles" quoted in the Thesis Rationale on page H came from. 

For open systems, one needs even more data space to hold the state, as the den- 
sity matrix p will have approximately half the square of the number of components 
in a state vector — exponential in system size again, but with twice the exponent. 
The general conclusion is that for mesoscopic (and even many microscopic) systems 
a brute force approach is doomed to failure, except for perhaps a few special cases 
that are separable^. 

1.2 Bulk properties are the key 

The exponential growth of Hilbert space mentioned above appears prohibitive, but 
let us first stand back from the quantum problem for a moment. In classical me- 
chanics there are enormous amounts of variables to keep track of once a system has 
macroscopic numbers of particles — Avogadro's number is very big! The reason 
that statistical mechanics is so successful at dealing with this issue is that for a 
large system, one is only really interested in bulk properties (temperature, pressure, 
two-particle correlations, . . . ). The exact position or momentum of particle number 
9501292851748 is not of interest to anyone. Even more importantly, knowledge of 
such a "raw" observable calculated from a model has no physical predictive power, 
because in such a large system it is not feasible to measure the initial conditions 
exhaustively. Only a statistical description (based on bulk properties) can be input 
as initial conditions into a model and because of this, only bulk statistical properties 
of the model will correspond accurately to the behavior of the physical system under 
study. 

In quantum systems, we have (in a general sense) an analogous problem, but it is 
much more acute because it turns up when we have O (5) subsystems, not O (10^). 
This is simply because in quantum mechanics there is much more to keep track of. 
The full classical description contains the positions and momenta of all the particles 
(e.g. system variables record things like "the position and momentum of particle 
753024"), whereas a full quantum description contains the amplitude and phase of 
^e.g. an ideal gas 
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each one of all the possible configurations between the entire set of particles^, (e.g. 
a quantum mechanics variable would be expressing something like "the probability 
and phase (relative to a general reference) of the particular configuration of particles 
for which: number 1 is at Xi, . . . , number 753024 is at 0:753024, • • • , and number N 
is at xat"). In classical mechanics the entire description is just one of these possible 
correlations. 

Conceptually, what lets one vanquish the enormous numbers of variables is sim- 
ilar in both cases: We're only interested in hulk properties, be they themselves 
classical or quantum in nature^. 

1.3 Sampling system configurations 

There are two kinds of conceptually distinct kinds of correlations to sample (rather 
than characterize exhaustively) to estimate the position of the quantum system in 
Hilbert space. 

Firstly, in models that include interactions with an external environment, the 
system is described fully by a density matrix rather than a pure state vector. Here, 
the mixture of pure states that combine to the reduced density matrix description 
of the system must be sampled. While this kind of sampling reduces the number 
of variables substantially (We go from some huge number to its square root), it is 
not enough to make a qualitative difference. The pure state vectors still reside in a 
space whose dimension scales exponentially with system size N . 

An example of an approach that carries out only this first kind of sampling is 
the quantum trajectories method for open systems. (Some reviews of this approach 
are e.g. the articles of ZoUer and Gardiner [21], and Plenio and Knight [22], or Wise- 
man and MilburnP^ for continuous variable measurements.). This method can be 
successful in simulations of several particles, especially when microscopic details of 
their entanglement are important — for example, simulations with up to 24 qubits 
have been reported |24j. 

^Or, more generally, "subsystems", particularly when particles are not conserved. 
^An experimentally accessible example of a macroscopic property of a "quantum" nature is the 
condensate phase at a given position in a Bose-Einstein Condensate. 
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What is really needed to attack at least mesoscopic systems, is to change the 
scaling with number of subsystems from exponential down to at least polynomial. 
This is achieved by going from the pure system states (which are in general super- 
positions of various separable system configurations) to a sample of the separable 
system configurations, (examples of such separable system configurations are: the 
set of occupations of all modes, or the set of positions of all the particles.) The 
essence of the two sampling steps can be schematically expressed as 

P — ' Ef'Lil^i') (^i'l, ^^^^ 

where the sums are over S and S' samples, and the tensor products over N subsys- 
tems, having local subsystem parameters Ck {C stands for separable Configuration). 
The pure states and 1-0^/) are not necessarily separable product states, but the 
samples ®k \ C^^)k are. In the limit of many samples [S, S' — > oo), the correspon- 
dence becomes exact. 

This second sampling is really the crucial step to allow any kind of first-principles 
quantum simulation of realistic mesoscopic models®, as the number of variables to 
keep track of now finally scales proportionally to N. 

The unavoidable price paid is loss of accuracy. Fortunately most of this price 
gets transferred onto the detailed non-bulk properties, which one is not interested 
in anyway. This occurs almost automatically, because there are just so many more 
of these detailed properties. In many cases bulk properties of quite large systems 
can be calculated with useful precision, where no simulation at all was possible with 
brute force methods. The calculation time required for a given uncertainty A now 
scales (by the central limit theorem) roughly proportionally to N/A'^, whereas for a 
brute force non-stochastic method it would be^ proportional to e^(log A)^. Clearly, 
one won't be getting more than two- or three-digit accuracy with the stochastic 
methods in most cases, but this is often sufficient for a good comparison between 
theory and experiment. 

^Obviously an enormous amount of physical prediction can be made with mean field, or other 
semiclassical methods, but by first-principles models I refer to those where no semiclassical ap- 
proximations are made. 

^The time required for multiplication operations on floating point numbers with dio « — log^g A 
digits scales as dio^ 
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1.4 Path integral Monte Carlo and static thermal 
calculations 

The most widely used approach for many-mode/body calculations is path integral 
Monte Carlo. The basis for these methods were Feynman's imaginary time path 
integrals j2S|- This method, however, is only successful for static calculations of 
thermal equilibrium states (reasons explained below). 

In the static thermal case, the un-normalized density matrix at temperature T 
is f)u{T) = exp —H/ksT , and path integrals make use of the simple (and exact) 
property that p„(T) = pu{MT)^. If M is sufficiently large that MkeT > {H), 
then one can use a high-temperature approximation^^ to evaluate p„(7WT). The 
density matrix 'pu{T) can be written as a convolution of the high temperature density 
matrix elements (in complete bases of separable system configurations C labeled by 

C(0), • • • , C(^M))- 

Pu{T) = j ... j dC(^o)dC^i) ■ ■ ■ dC^M) (1-2) 
{C^o)\pu{MT) |C(i)> (C(i)l p,(A^T) |C(2)> ■ • • {C^M-i)\pu{MT) \C^m)) • 

To obtain properties diagonal in the configuration basis chosen, one takes samples 
while setting C(o) = C(;k), and ends up sampling a "ring polymer" of configurations, 
the number of variables in a full sample scaling as AiN. The distribution of the 
samples is given by the product of the expectation values: 



P(C(o), . . . , C(^)) = ®fl, I MMT) |C(fe)> 



1.3) 



which is real positive, or largely so. Off-diagonal properties require separate calcula- 
tions with slightly different sample properties. An extensive discussion of the details 
of the path integral approach can be found e.g. in Ceperley|2Zl EH] and references 
therein. 

While, formally, this approach might appear to be also applicable for dynamical 
evolution via p(t) = p(0) exp —iHt/h , in practice this does not work. The reason 

^{Hv + HK)/kBT = 
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Typically one uses the Feynman-Kacs or Trotter formula 26' exp 



lim 



M^oo exp 



-Hv/MkeT 



exp 



M 



—Hn/AdkBT ) to evaluate the effect of the potential iJv 



and kinetic Hk terms of the Hamiltonian separately. 
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is that now P{C(j)) is primarily a product of complex phase factors, rather than real 
positive values. One obtains a very wide range of configurations, all with similar 
weighting, but mutually canceling phase factors. In the limit of infinite trajectories, 
almost all of these configurations would destructively interfere leaving just the ob- 
served dynamics, but for finite numbers of samples in a complex system there is no 
automatic way to tell which configurations are the important ones. The end result 
is that no sensible observable averages emerge from the noise even for very short 
times. 



1.5 phase-space evolution methods 

The density matrix of a system can often be written in a form 

p = j P{C)A{C)dC, (1.4) 

where C is a separable system configuration^^, containing some number of variables 
linear in N (the number of subsystems). -P(C) is a positive function, and A(C), the 
kernel, is a projector or similar tensor product operator parameterized by C. 

In a generic case where pure quantum states of each subsystem (labeled k) can be 
fully described by the set of parameters C^, a viable kernel would be A(C) = e'^ (8)^ 
\Ck)k (C'felfe. Here |Cfe)^ is the pure state of subsystem k given by parameters C^, and 
^ is a phase. The full configuration then would be C = {9, Ci, . . . , Cat, Ci, ■ ■ ■ , Cn}- 

The basic idea is that P{C) can be interpreted as a probability distribution of the 
configuration C. Then, the evolution equation for p (with time or with temperature) 
produces a corresponding evolution equation for P, and finally evolution equations 
for the configuration samples C. Depending on the system and the kernel A chosen, 
this procedure can often be carried out, and evolution equations for the configuration 
samples of the system derived. In particular, dynamical evolution can be simulated 
while the number of variables remains linear or polynomial in A^. 

In fact phase-space distribution methods are much more versatile than path 

integral approaches on a number of counts: 

^^The variables of C can be discrete or continuous. In the former case the integral over C is 
replaced by a sum. 
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1. Dynamics can be simulated. 

2. Results are given for a range of times (or temperatures) by one simulation 

3. All observables can be calculated in one simulation 

In both phase-space distribution and path integral approaches, the amount of com- 
puter effort scales linearly with system size A^. More details of how this is achieved 
are given in Chapter El See also a tabular comparison of the various quantum 
many-body/mode simulation approaches in Tables ITTT] and (TT^ 

Unfortunately, a major stumbling block for phase-space distribution methods is 
that the equations for the configuration variables in C often have stability problems, 
which can lead to large noise and/or statistical bias. This is one of the main reasons 
that these methods have not been as widely investigated nor used as path integral 
Monte Carlo, despite their other advantages. 

This thesis concerns itself with methods of overcoming this stability problem 
using the stochastic gauge technique, and some demonstration of subsequent appli- 
cations. It is also intended to show that when stability issues are taken care of, these 
phase-space distribution methods can be competitive with path integrals for static 
thermal ensemble calculations as well. 

1.6 Open systems and the mode formulation 

Actual physical systems do not exist in isolation, and there is always some degree of 
coupling to the external environment. This coupling becomes ever more pronounced 
as one moves from microscopic to mesoscopic systems, and is considered responsible 
for the reduction of quantum phenomena to classical behavior as the macroscopic 
limit is reached. 

Traditionally in most many-body studies the system has been taken to be com- 
posed of a set number of A^ particles. A different formalism is required if one is to 
investigate open systems that can exchange particles with a reservoir having chemi- 
cal potential fi, or for dynamical calculations if particles are lost or injected into the 
system. A natural choice is to consider lattice models where rather than dividing 
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Table 1.1: A comparison of different methods to exactly simulate quantum dynamics from first 
principles. As applicable to many-body/mode simulations with N subsystems (e.g. particles, 
modes). 



Variety: 


Explicit 


Quantum 


Path 


Phase-space 




Density Matrix 


Trajectories 


Integral 


Distributions 


Stochastic 


No 


Yes 


Yes 


Yes 


Sample size 




oce^ 


oc MN 


oc N 


Phase factor 










catastrophe'* 


No 


No 


Yes 


No* 


Unstable equations 










or sampling bias 


No 


No 


No 


Often^ 


Observables from 






Limited 




one calculation 


All 


All 


subset 


All 


Simulation scope 


Range of t 


Range of t 


Single t 


Range of t 



''Where samples of similar weight but different phase factors interfere, masking any physically 

meaningful results. 

''At long times some gauged schemes may develop interfering phase factors. This occurs only 
for some gauges in some systems. 

'^However, in a wide range of cases the instability can be removed using the stochastic gauge 
technique, as shown in this thesis. 

the system into N subsystems consisting of a particle, we divide the system up into 
spatial (or momentum) lattice points that can be occupied by a variable number 
of particles. The kernel A then becomes a separable tensor product of lattice-point 
states, described by some local parameters. Their total number for a system config- 
uration sample is again proportional to A^. Because we want the methods developed 
in this thesis to be applicable to systems that allow particle exchange with an en- 
vironment, this is the kind of model that we will mostly concentrate on here. The 
general methods developed are, however, by no means restricted to mode-based 
models. 

A feature to be aware of in such a model is that in general the state will be in 
a superposition or mixture of pure states with different numbers of total particles. 
This is not expected to hinder comparison with experiment, however. A subsystem 
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Table 1.2: A comparison of different methods to exactly calculate static quantum properties of 
equilibrium thermodynamic ensembles from first principles. As applicable to many-body/mode 
simulations with N subsystems (e.g. particles, modes). 



Variety: 


Explicit 


Quantum 


Path 


Phase-space 




Density Matrix 


Trajectories 


Integral 


Distributions 


Stochastic 


No 


Yes 


Yes 


Yes 


Sample size 




N 

oc e 


oc MN 


oc 


Phase factor 






For fermions" 


Can occur 


catastrophe^ 


No 


No 


keT < AEfb 


for low T 


Unstable equations 










or sampling bias 


No 


No 


No 


Often'^ 


Observables from 






Limited 




one calculation 


All 


All 


subset 


All 


Simulation scope 


Range of T 


Range of T 


Single T 


Range of T 



"For fermion calculations the "fermion sign problem" appears after fc^T < AEpB, where AEps 
is the free energy difference between the fermion system, and an analogous system of bosons. For 
a review see Schmidt and Kalos|29J 

''Where samples of similar weight but different phase factors interfere, masking any physically 
meaningful results. 

"^However, in a wide range of cases the instability can be removed using the stochastic gauge 
technique, as shown in this thesis. 



such as e.g. a trapped gas is perfectly entitled to be in a superposition of number 
states, and e.g. evaporative cooling of Bose atoms will produce BECs of different 
sizes in each individual experimental run. 

For open systems with particle transfer to the environment, the boundary where 
the system ends and the environment starts will be at best hazy, and particles will 
take some time to travel from one to the other. There will always be a gray area, 
and in it some particles, where the labeling of particles as belonging to the system or 
environment is dubious. Superpositions and/or mixtures of different particle number 
states are actually trying to reflect this labeling difficulty which is encountered in 
experiments. 

Also, in many situations the effect of such superpositions will be small: A rea- 
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sonably accurate description of a realistic mesoscopic system in most cases requires 
a fairly large number of lattice points to resolve spatial and momentum variations in 
the system. In such a case, for statistical reasons, the variation in particle number 
of the whole system will be small in comparison with the total number of particles. 

Lastly, it should also be pointed out that if desired, one can always simply 
turn off the external couplings, and obtain results for closed number-conserving 
Hamiltonian models, which are just special cases with external interaction strength 
zero. (Although a simulation method that is "hard- wired" to a conservative system 
is likely to be somewhat more efficient than an open systems method.) 
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Chapter 2 

System of choice: Open 
interacting Bose gases 

Simulations presented in this thesis mostly^ concern themselves with simulations of 
interacting Bose gases in a lattice model with two-particle interactions^. Here those 
basic details of the model that are repeatedly referred to in the body of the thesis 
are introduced. Also, this choice of mesoscopic system from the wide variety whose 
simulations could have been attempted is motivated. 

This chapter does not attempt to present a review of the state of knowledge on 
interacting Bose g 3iSGS, clS this is currently very extensive and doing so would unnec- 
essarily lengthen the thesis without introducing any additional novelty. Instead, I 
recommend to the reader the excellent reviews on both theoretical and experimental 
aspects of Bose Einstein condensation in cold alkali gases by Leggettj^, Dalfovo et 
a/[3T], Parkins and Walls[32j. Topical also are the 1998 Nobel lectures of Chu|33j. 
Cohen-Tannoudji[34j, and Phillips on laser cooling and magneto-optical trap- 
ping, as well as the reviews of evaporative cooling by Ketterle and van Druten[HT)] 
and Walraven[37J. An overview of recent developments can be found in e.g. the col- 
lection of articles in Nature Insight on ultracold matter El El El 120] • Physical 
properties of interacting Bose gases pertinent to the analysis or interpretation of the 

^Although not exclusively, see for example Chapter El 

^In the most commonly used cold alkali-metal gas models these interactions arc local as in a 
Bose-Hubbard model, and are then often referred to as "delta-function" interactions. 
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simulations in Parts B and C of this thesis are given in the relevant chapters. 

2.1 Motivation 

The choice to concentrate on cold dilute interacting Bose gas models when trying 
out the simulation methods developed in this thesis is motivated by the following 
considerations: 

• The prime motivation is that mesoscopic systems that are very well described 
by these models are experimentally accessible. These are the cold dilute alkali- 
metal gases at temperatures below or near the Bose-Einstein condensation 
(BEC) temperature. In the last several years production of cold Bose gases 
of alkali-metal atoms has become almost routine, occurring in many tens of 
labs around the world. The largest interest (see Figure l?!T|) has been in gases 
below the condensation temperature where quantum effects come to dominate 
the system even though it is mesoscopic or even approaching macroscopic in 
size. These were first realized in 1995 by Anderson et al 6\ (^"^Rb), Davis et 
a/[ZI (^^Na), and Bradley et a/jSjj (^Li), by laser cooling the gas in a trap, and 
subsequently lowering the temperature even further by evaporative cooling. 
Three years later the first signatures of BEC were seen by Fried et al 9J in the 
somewhat different case of hydrogen. Presently, many variants of these systems 
are being investigated experimentally. For example, quasi-one- dimensional 
gases have been achieved jSHl EHl EHj, and coherent four- wave mixing between 
atom clouds has been observed jHj, just to name a few of the experimental 
directions relevant to the models considered later in this thesis. 

• A related argument is that there are phenomena in the experimental systems 
that are not precisely understood and appear quite resistant to conclusive 
theoretical analysis by approximate methods. This invites a first-principles 
investigation. 

Most of these occur in situations where there is a transition or interplay be- 
tween disordered high temperature behavior well above condensation temper- 
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ature Tc, and the pure many-body quantum states approached as T — 0. The 
first can be treated with kinetic theories, while the latter is very well described 
by the Gross-Pitaevskii (GP)|121 113 Ell mean-field equations, or Bogoliubov 
theory 1^^. As usual in physical systems, the intermediate regime is the most 
complicated and is the place where first-principles simulations are the most 
needed. 

Examples of such issues where first-principles simulations can make a step 
forward to full understanding are, for example: 

1. The behavior of the system during initial condensation, when the T ^ Tc 
barrier is crossed. This includes issues such as "does the always conden- 
sate form in the ground state, or is (at least metastable) condensation 
into excited states possible?" There are some indications that conden- 
sates may occur e.g. in states of non-zero centre-of-mass mot ion 

2. Is the exact ground state of a BEG the same as that obtained by semi- 
classical GP methods? 

3. Dynamics of condensates during atom interferometry. 

4. The decoupling of atoms from the trapped condensate in an atom laser 
arrangement. How is the emitting condensate disturbed, and in what 
state are the pulses emitted? 

5. Scattering of atoms into empty momentum modes when condensates 
coUidejn]. 

6. Heating and other processes that eventually destroy the condensate in 
experiments. 

It is worth noting that most of the above processes usually involve coupling of 
the condensate to an external environment. This is further indication that a 
simulation method capable of handling open systems is desirable. 

• Not only is the field of Bose-Einstein condensation experimentally accessible, 

but the sheer amount of research on the topic has been growing (and continues 
^At finite temperatures , but significantly below Tc 
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Figure 2.1: Papers published in refereed journals (by year) on the topic of Bose- 
Einstein condensation. Based on a search for the keywords "Bose Einstein condens* " or 
"Bose condens* " through titles and abstracts in the ISI "Web of Science" citation database ( 
http://wos.isiglobalnet.coin/ ) on 11 Feb 2004. Note the rapid increase after the first successful 
BEC experiments in 1995. 

to grow) at a great rate (see Figure ITTjl . This would indicate that some of the 
predictions made may be verified rapidly. 

• Lastly, but importantly, many-mode simulations of similar systems have been 
successful with a phase-space method (the positive P distribution |1U1 [TT|). 
Some examples of this include simulations of evaporative cooling of a Bose gas 
to the beginning of condensation |14 [ IT5 \ I46j. as well as simulations of optical 
soliton propagation in nonlinear Kerr media ^21 EI, which have a similar form 
of nonlinearity to ()2.17p — quadratic in local particle density. The gauge 
distributions developed in this thesis include the positive P distribution as a 
special case, and improve on it. This previous success is an indication that 
the newer methods are likely to lead to significant results with these systems. 
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2.2 Field model with binary interactions 

Consider a P-dimensional boson field ^E'(x) undergoing two-particle interactions. 
After second quantization the Hamiltonian of the system can be written as 

H = /,-x|^^^ + K»(x)$.(x)®(x)| 

J |2m fts 9x y J y J y J j 

+ ^y"rf^xt;^yt/(x-y)$t(x)$t(y)$(x)$(y). (2.1) 

The bosons have mass m, move in D-dimensional space whose coordinates are x, 
experience an external conservative potential Vext(x), and an interparticle potential 
f/(r) = U{—r) where r is the spacing between two particles. The field operators 
^^(x) and ^(x) are creation and destruction operators on bosons at x, and obey 
the usual commutation relations 

"$(x),$t(y) 

The total number of bosons is 

N = J rf^x§t(x)§(x). (2.3) 

In a typical Bose-Einstein condensation experiment with cold trapped alkali-metal 
gases the trapping potential Kxt is parabolic, in general non-spherical. 

Three-body^ processes are not included in the model. While these can be ne- 
glected for most contemporary experiments with cold rarefied alkali-metal gases 
or BECs, they can become significant when density is high enough, or when one 
operates near a Feshbach resonanace, requiring modifications to the model. (For 
example, three-body recombination into molecules |47| EH] can play an important 
role in loss of atoms from a BEC). 

2.3 Reduction to a lattice model 

To allow computer simulations, 1)2.11) must be reduced to a lattice form. Provided the 
lattice is sufficiently fine to resolve all features, no change in physical interpretation 
occurs. 

^Strictly speaking, all > 2 body. 



(2.2) 
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One issue to keep in mind, however, is that some care must be taken when going 
from a continuum to a lattice model to make sure that the behavior of the latter 
at length scales longer than the lattice spacing is the same — i.e. that the lattice 
model has been renormalized. 

One way to do this is to impose: 

1. A spatial box size in each dimension L^, 

2. A momentum cutoff in Fourier space fc™'^^, 

where spatial dimensions have been labeled hj d = 1, . . . ,V. In this case one 
proceeds by expressing all spatially- varying functions (generically called /(x)) in 
Fourier modes as 

/w= (n^)E^^^'"^ (2.4) 

where the index n = {rii, . . . ,n-z)} has an integer component = 0, ±1, ±2, . . . 
for each dimension. The wave vector kg = {kiijii) , . . . , kx>{ni:i)} also contains V 
components 

kdijid) = 27ind/Ld. (2.5) 

This has imposed the spatial box size, and now we also impose a momentum 
cutoff k"^^^ by truncating all terms in ()2.4j) for which any \kd{nd)\ > fc™**^. That is. 



/(x) - /cut(x) = n ^ E ■ ■ ■ E /n^^'"''- (2-6) 

\ d ) |ni|<LiA:™='''/27r |ni:)|<ii:)fcg'"'/27r 

This leaves an Md point lattice in each dimension, with Md = 1 + 2int [Ldk^^^ /27r], 
when int [ ■ ] gives the integer value (rounded down) . 

The lattice points in Fourier space kd have been defined by ()2.5j) . the function 
values there are /h, while the function values in normal space /„ are given by the 
discrete inverse Fourier transform of the f^'- 

fn = /cut(x„). (2.7) 

Here the spatial lattice is Xn = {xi(ni), . . . , XT>{nx>)} given by 

Xdind) = ridAxd, (2.8) 
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with spacing Axd = Ld/Md . The index n is composed of non-negative integers 
n = {ni, . . . , riv} : = 0, 1, . . . , - 1. 

The lattice model is physically equivalent to the continuum field model provided 
that 

1. No features occur on length scales ~ Ld or greater. 

2. The truncated Fourier components (/h corresponding to any kd > k^^^) are 
negligible so that for all practical purposes /(x) = /cut(x). In other words, no 
features occur on length scales ~ Axd or smaller. 

Conversely, in cases when some processes occur entirely on length scales smaller 
than the lattice spacings Axd-, the lattice model may be renormalized in non-trivial 
ways (so that /„ ^ /(xn) ) to still be physically equivalent to the continuum model 
on lattice spacing length scales. An example is outlined in Section 12.41 

Proceeding in this manner for the case of extended interparticle interactions 
()2.1|) . and using the lattice notation of ()2.4|) . ()2.6j] . and ()2.7|) . let us define the 
lattice annihilation operators 

an = $n^nAxd, (2.9) 

which obey the boson commutation relations 

[an,ay=5nm. (2.10) 
The particle number operator at lattice point n is 

nn = aj^an. (2.11) 
With these, one obtains the expression: 

(2.12) 

In this normally ordered expression, the frequencies c^nm = ^^mn come from the 
kinetic energy and external potential. They produce a local particle number depen- 
dent energy, and linear coupling to other sites, the latter arising only from kinetic 



H 



huJr 



(5nm) 
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processes. The frequencies uOnm are explicitly 




(2.13) 



where the function K^'^'' is the P- dimensional discrete inverse Fourier transform of 
\^\' = EAl That is, 




(2.14) 



The interparticle potential can also be discretized on the lattice, however there 
are a few subtleties caused by the finite box lengths L^. Let us define the lattice 
mutual interaction strength = f4ut(xn) in the notation of ()2.7|) and ()2.6|) . At 
first glance, the lattice scattering terms might be Unm = U^-m since Xd{nd) oc n^, 
but there are two issues that complicate matters: 

1. One has only non-negative lattice labels n and coordinates Xd{nd), as seen 
from 

2. The Fourier decomposition on a box of side lengths Ld implicitly assumes 
periodic boundary conditions, so e.g. particles at lattice points n = and 
m = {Ml — 1, ... , Md — 1} are effectively in very close proximity. 

In the continuum field model the interparticle potential is symmetric {U{r) = 
f/(— r)), so to impose periodicity on length scales Ld, the lattice interparticle po- 
tential should obey [/„ = f/{A/i-ni,n2,...,ni,} = ■ ■ ■ = U{nun2,...,MT,-ni,}- This Symmetry 
then lets us write the scattering terms as 

''^nm — Umn — f^|n— m|) (2.15) 

where the notation means |n — m| = {\ni — mi| , . . . , \nxi — fnxi\}- For the lattice 
model to be equivalent to an open continuum field model also requires to be 
negligible when any Ud ^ O {Md/2). 

2.4 Locally interacting lattice model 

The majority of mesoscopic experiments with interacting Bose gases where quantum 
mechanical features have been seen use alkali-metal atoms (Li, Na, K, Rb, Cs, 
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and also H) in or near the BEC regime (see also Section \2.1^ . where a significant 
simplification of the lattice Hamiltonian ()2.12j) can be made. 

Broadly speaking, what is required is that only binary collisions be relevant, and 
that these occur on length scales much smaller than all other relevant length scales 
of the system, including the lattice spacings Axd- The simple-minded continuum to 
lattice procedure of the previous Section EHl no longer suffices, because the interpar- 
ticle interactions occur at momenta well above k^^^. Detailed consideration to this 
renormalization is given in Section IV of the aforementioned review by LeggettjHOl, 
and in even more detail by Dalibardjinj and Wiener et a/jSD]- The reasoning for 
single-species cold alkali-metal Bose gases is (in brief) as follows: 

• The leading term in inter-atomic potentials at long range > SA for the alkali- 
metal atoms is the van der Waals interaction oc l/|r|^. 

• This potential gives a van der Waals length rvdw and energy -Evdw ~ ^^/'"^'^vdw 
characteristic of the atom species, which are the typical extent and binding 
energy of the largest bound state. For the alkali-metals -Evdw is of order 0.1 — 
ImK, except for hydrogen, which has -Evdw ~ 3K. 

• The binding energy of these states is much higher than thermal energies at 
or near the BEC condensation temperature for these atoms (typically, Tc ~ 
O (lOOnK)). For atom pairs with relative orbital angular momentum /rei 7^ 0, 
the scattering strength is smaller than s-wave scattering (Zrei = 0) by a factor 
of ikuT I E.rA..,')'-^''^ [HH] . Conclusion: for fc^T ^ -Evdw only s-wave scattering 
interactions need be considered. 

• For van der Waals potentials, s-wave scattering at low energies can be charac- 
terized by a single parameter, the s-wave scattering length (see e.g. Landau 
and Lifshitzjni], Sec. 108). This is typically of similar size to r^dw (e.g. for 
^''Rb, as = 5.77nm ^3), and can be considered to be roughly the radius scale 
of an "equivalent" hard sphere (For indistinguishable bosons, the scattering 
cross-section is Svra^). 

• If the s-wave scattering length is much smaller than all other relevant length 
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scales of the system, then the exact inter-atom interaction can be replaced 
by a local lattice interaction where Unm oc 5nm in the formalism of ()2.12|) . 
Relevant length scales include the de Broglie wavelength of the fastest atoms, 
the interparticle spacing, the trap size, the transverse thickness (in the case of 
2D or ID systems), and the lattice spacing Axd- 

Following this broad line of argument, it can be shown EUl EOj that for two 
indistinguishable bosons in 3D, the scale-independent couphng constant g is given 
by 

9 = • 2.16 

m 

This then leads to a lattice Hamiltonian of the form 

H = ^hWnrnaiarn + hx^UniUn- I), (2.17) 
nm n 

with the (scale-dependent) lattice self-interaction strength being 

In the case of an effectively one- or two-dimensional gas of indistinguishable 
bosons, the couphng constant is instead given by 

where Aq is, respectively, the effective thickness or cross-section in the transverse 
(collapsed) dimensions. 



2.5 Open dynamic equations of motion 

For a wide range of physical situations the interaction of an open system with its 
environment can be considered Markovian, and in this situation can be modeled 
with a master equation, where the evolution of the density matrix is dependent only 
on its present value, with no time lag effects. Broadly speaking, this means that 
any information about the system received by the environment via interactions is 
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dissipated much faster than the system dynamics we perceive. Hence, no feedback 
into the system occurs. 

Under these conditions the equation of motion for the density matrix of the 
system can be written (in Linblad form) as: 



dp _ 1 
dt ih 



The Linblad operators Lj represent various couphng processes to the environment, 
and their number depends on the details of the system-environment interactions. 
Note that p is the reduced interaction picture density matrix of the (Bose gas + 
environment) system, traced over the environment. Some examples of common 
processes are: 

Single-particle losses to a standard^ zero temperature heat bath with rates 7n at 

Xn: 

2„ = anv^. (2.21) 

Exchange with a finite temperature heat bath with coupling strength propor- 
tional to 7n at Xn: 

Ln = anA/7n(l +^bath), 

^1 -^i — (2.22) 

= aW7n^bath, 

Where the sum in fl2.20|) is taken over both terms in and L'^. For the standard 
heat bath with energy ^c^bath per particle for all modes, rlbath, the mean number of 
particles per mode, is given by the Bose-Einstein distribution expression 

^bath = l/[exp{hwbs,th/kBT) - 1], (2.23) 

and hence for a zero temperature bath rlbath = 0. 

Two-particle (at a time) losses to the standard zero temperature heat bath[TT] 
with rate 7^^'* at Xn: 



Ln = alJ^^^\ (2.24) 



^The Hamiltonian of the heat bath is taken to he H = ftwbath J2j (^j^i + with bj being the 
annihilation operator for the jth heat bath subsystem. 
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Coherent gain from a driving field at Xn can be modeled by adding a term of 
the form 

H, = ihJ2[^nai-sla^] (2.25) 

n 

to the Hamiltonian. 

Details of the derivation of the loss/gain expressions (like those above, or more 
general) can be found e.g. in Gardiner [HHj. Chapter 10, or Louisell|Sll ES]- 



2.6 Thermodynamic equations of motion 

The (un-normalized) density matrix of a grand canonical ensemble in contact with 
a bath at temperature T and with chemical potential fi is given by 



Pu = exp 



-{H-fiN)/kBT 



(2.26) 



In the second expression the "imaginary time"^ r = l/ksT and the "Kamiltonian" 
K have been introduced. 

The rate of change of with r can be written as'' 



dpu 
dr 



dKr 
~d7 



,Pu 



dKr 



dr 



-pu, 



(2.27) 



provided that 



dK 
97' 



K 



0. 



(2.28) 



For the Hamiltonian here ( ()2.12|1 or ()2.17|1 ). the equation of motion can be 
written 



dr 



Pe{r)N - H 



pu, 



(2.29) 



where the "effective" chemical potential is 

d[rp{r)] 



Pe[r) 



dr 



(2.30) 



^So called because of the similarity of the left equality of H2.27|l to H2.2()|l with t replaced by 
ihT/2, and Lj = 0. 



'^[A,B]+ = AB + BA. 
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The condition ()2.28|1 implies, in this case, that only the chemical potential /i(T) can 
be temperature dependent^ 

The utility of this equation stems from the fact that the grand canonical ensemble 
at r = (i.e. high temperature T oo) is known, and given by the simple 
expression 



Pu{0) = exp 

where 



-XnN 



(2.31) 



Xn = - lim [r/i(r)] , (2.32) 

and the initial mean number of particles per lattice site is 

uq = [e^- -ly^ ■ (2.33) 

This leads to initial conditions for the stochastic simulation that can be efficiently 
sampled in most cases. Finite temperature results are then obtained by evolving 
the system forward in r. 



^Strictly speaking, a constant (in space) external potential V°^*' may also be temperature de- 
pendent, but this is physically equivalent to a correction to /i. Quantities such as g and V°^*{x) 
must be constant with T for (|2.29|) to hold. 
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Chapter 3 

Generalized quantum phase-space 
representations 

3.1 Introduction 

In this chapter a formahsm encompassing very general phase-space distributions 
describing quantum states and their evolution is presented. The correspondence 
made is between a quantum density operator and a distribution of operators, or 
"kernels", as in ()1.4|1 . The motivation for this is to investigate what distribution 
properties are essential for tractable mesoscopic first-principles simulations, and to 
provide a systematic framework in which comparison between methods is simplified. 
To this end, the following two conditions on the distribution will be kept in mind 
throughout: 

• Exact correspondence: A statistical sample of the kernel operators chosen 
according to the phase-space distribution must approach the exact quantum 
density matrix in the limit of infinite samples. This property must be main- 
tained during evolution via stochastic equations. 

• Variable number linear in system size: The number of variables needed to 
specify a kernel (and hence the number of stochastic equations needed to model 
quantum evolution) must scale linearly with A^, the number of subsystems. 
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The formalism is introduced in Section ESJ and correspondence between density 
matrix and distribution considered. Subsequently, it is shown how the density matrix 
corresponds to a statistical sample of kernel operators in Section 13.31 while in 13.41 
it is shown how the quantum evolution corresponds to a set of stochastic equations 
for these. Along the way, examples are given using the positive P representation, 
commonly used in quantum optics. The resulting requirements on the operator 
kernels for such a scheme are summarized in Section 13.61 

It is also shown that the phase-space distribution methods have two generic 
properties that are convenient for practical simulations: 

• Parallel sample evolution: The individual stochastic realizations of the 
kernel operator (which are later to be averaged to obtain observables) evolve 
independently of each other. This allows straightforward and efficient parallel 
computation (more detail in Section 

• All observables in one simulation: One algorithm is capable of giving 
estimates for any/all observable averages (see Section 13. 3. 

Some previous generalizations of phase-space representations such as generalized 
P representations [TT| . and the (already quite general) discussion of phase-space rep- 
resentations in the article by Drummond and Deuar|nni, are contained as special 
cases of the discussion in this chapter. 

3.2 Representation of a density matrix 

We expand the density matrix 



as in ()1.4|) . where C is a set of configuration variables specifying the kernel operator 
A(C). The idea is that if P is real and positive, it can be considered a probability 
distribution, and we then can approximate the density matrix using S samples A(C). 
Each such sample is fully defined by its set of configuration parameters C. 




(3.1) 
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3.2.1 Properties of the distribution 

To be able to interpret the function P{C) as a probability distribution, we require 
P to be 

• Real. 

• non-negative: p > 0. 

• NORMALIZABLE: i.e. / P{C)dC converges to a finite value. 

• NoN-siNGULAR. The primary reason why a singular distribution is a problem 
is that it may be incapable of being sampled in an unbiased way by a finite 
number of samples. For example, if one initially has a non-singular distribution 
P{t) but singularities arise through some dynamical process after a time tsing? 
then a set of samples that were unbiased estimators of P{t) initially {t <^ ^sing), 
will generally be incapable of sampling the singularities at t > tsing correctly. 

Some singular behavior can, however, be tolerated in initial distributions (e.g. 
-^l"^) — Sj<5'^('^ ~ '^o"''') some real variable v) provided the number of 
singularities is finite and much less than the number of samples S. This can 
even be desirable to achieve a starting set of samples that is compact. 

If these conditions are satisfied, then when each sample (labeled by j) is defined 
by its own configuration parameters C^^\ 



Many well-known distributions of the general form ()3.H1 do not satisfy these 
conditions for general quantum states. For example the Wigner distribution |57j 
is commonly negative in some regions of phase space when the system exhibits 
nonclassical statistics, the complex P distribution [TTj is not real, and the Glauber- 
Sudarshan P distribution jSHl EH] can be singulariGQ^ (also when nonclassical statistics 
are present). These distributions are often very useful, but not for many-body 
stochastic simulations. 



s 




(3.2) 
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3.2.2 Splitting into subsystems 

A mesoscopic system will consist of some number of subsystems. In this thesis 
the subsystems are usually spatial or momentum modes, although other common 
approaches are to label as subsystems individual particles (if their number is con- 
served), or orbitals. 

A crucial aim, as pointed out at the beginning of this chapter and in Section [T!^ 
is to have system configurations C specifying each sample contain a number of 
parameters (variables) that is only linear in A^. Otherwise any calculations with 
macro- or mesoscopic A^ will become intractable due to the sheer number of variables. 
Hence, the kernel A should be a separable tensor product of subsystem operators^: 



Here the local kernel for each kth subsystem is described by its own set of local con- 
figuration variables Ck- There may also be some additional "global" variables (of 
number -C O (N)) in the set Cgiob affecting global factors /giob(Cgiob) or several sub- 
systems. The full configuration variable set for the kernel is C = {Cgiob, Ci, ■ ■ ■ , Cm}- 
A separate issue altogether is the choice of basis (i.e. the choice of A^) for 
each subsystem. This chapter aims to stay general, and choosing A^ is deferred to 
Chapter El and later, apart from discussing general features of the As necessary for 
a successful simulation. 

3.2.3 Dual configuration space of ofF-diagonal kernels 

Here, arguments will be put forward that for non-trivial simulations the local kernels 
Afc are best chosen to include off-diagonal operators. 

A density matrix can always be diagonalized in some orthogonal basis, however 
there are some basic reasons why such a basis is not usually suitable for many-mode 
simulations of the type considered here. 

1. Firstly, a general quantum state will contain entanglement between subsys- 
tems, which precludes writing it as a distribution in separable form ()3.3j) with 

^Strictly speaking, A could also be a finite sum of N) tensor products of subsystem operators. 
This may be useful in some situations. 




(3.3) 
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only diagonal kernels. 

2. Secondly, while for some situations the state will, after all, be separable, and 
could be written using only locally diagonal kernel operators, generally the 
basis that diagonalizes the density matrix will change with time in a nontrivial 
way. The kernels, on the other hand, do not change. This means that while 
the initial separable state could, in this case, be sampled with diagonal local 
kernels, the subsequent exact quantum evolution could not be simulated using 
those kernels. 

3. Thirdly, exceptions to the above arguments occur if one has a system that re- 
mains separable while it evolves. Alternatively, if its inseparability has simple 
time-evolution that could be found exactly by other means, then this time- 
evolution could be hardwired into a time- dependent kernel A{t) in such a way 
that ()3.3|1 continues to hold with diagonal (now-time-dependent) local kernels. 
In such a case, however, there is no point in carrying out time-consuming 
stochastic simulations of the whole system, when one could just investigate 
each subsystem separately. 

Still, the above arguments are not a rigorous proof, primarily because non- 
orthogonal basis sets have not been considered. However, no diagonal distribution 
that can be used to represent completely general quantum states is presently known. 
For example the Glauber-Sudarshan P distribution jSHl EHj, which has as projec- 
tors onto local coherent states (which are non-orthogonal), is defined for all quantum 
states, but it has been shown that when some nonclassical states are present, the 
distribution P becomes singular [UU]. and not amenable to unbiased stochastic sim- 
ulations. 

In summary, it appears that to allow for off-diagonal entangling coherences be- 
tween subsystems, the local kernels should be of an off-diagonal form 



where the and are subsets of different independent parameters, and the local 
parameter set for the kernel is Ck = {C{., C{.}. 




(3.4) 
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Finally, an exception to this off-diagonal conjecture might be the dynamics of 
closed systems — which remain as pure states for the whole duration of a simulation. 
In such a case, one might try to expand the state vector (rather than the density 
matrix) {ip) directly, along the lines of 

= j P^C^, 6) e'' \C^) dC^dd. (3.5) 

The global phase factor e*^ is required to allow for superpositions with a real positive 
-Pi/)- 



3.2.4 An example: the positive P distribution 

So that the discussion does not become too opaque, let us make a connection to how 
this looks in a concrete example, the positive P distribution ITT]. This distribution 
has widely used with success in quantum optics, and with Bose atoms as well |15| IM] 
in mode-based calculations. It also forms the basis of the gauge P distribution, 
which will be explained and investigated in Chapter and used throughout this 
thesis. The positive P local kernel at a lattice point (spatial mode — of which there 
are N) is 



^fc = (/^fclfesxp + +-|/3fc|J, (3.6) 

where the states l^fc);, are normalized coherent states at the A;th lattice point with 
amplitude a^, their form given by 

" 1, 



= exp 



|0).. (3.7) 



Note: the subscript k on state vectors indicates that they are local to the fcth 
subsystem, and |0)^ is the vacuum. The global function is just /giob(Cgiob) = 1, and 
the entire kernel can be written in terms of vectors of parameters a = {ai, . . . , oat} 
using 

l«) = ®f=i \(^k), (3.8) 



as 



A = |a) exp ( -a ■ /3 + ^\<x\^ + ^\f3\^ ] . (3.9) 
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The sets of parameters are Cgiob = {},Ck = {ak,Pk}- The kernel is in "dual" 
off-diagonal operator space, and it has been shown [TT| that any density matrix 
can be written with real, positive P{C) using this kernel. It has also been shown 
constructively that a non-singular positive P distribution exists, and is given by 



P+(a,/3) 



(4 



TT 



2W 



-l«-/3T 



a + 13* 



a + (3* 



(3.10) 



There are four real variables per lattice point. 



3.3 Stochastic interpretation of the distribution 

To efficiently sample the quantum state we make two correspondences. Firstly, as 
discussed in Section 13. 2| the distribution P{C) corresponds to the density matrix 
p. The second correspondence, as per ()3.2|1 . is between the distribution P{C) that 
exists in some high- dimensional space, and S samples C^^^ distributed according to 
P{C). 



3.3.1 Calculating observables 

Quantum mechanics concerns itself with calculating expectation values of observ- 
ables, so the equivalence between it and the stochastic equations for the variables 
in C rests solely on obtaining the same evolution of observable averages. 

Suppose one wishes to calculate the expectation value of observable O given a set 
of S operator samples {A{C^^^)}, with j = 1, . . . ,S. (Operationally, one actually has 
the set of S operator parameter sets {C^-'-'}.) For a normalized density matrix, the 
expectation value is (O) = Tr Op , however e.g. in the thermodynamic evolution 
of Section 12.61 one has un- normalized density matrices, in which case 



Tr 



{O) 



Opu 



Tr [pu] 



(3.11: 



Using the representation ()3.1|1 this leads to 



/P(C)Tr ^ 


OA(C) 


dC 


/P(C)Tr 




dC 



(3.12) 
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We can make use of the Hermitian nature of both density matrices and observables 
to get extra use out of non-Hermitian (off-diagonal) kernels, giving 



/ PiC) (Tr 


6a(c) 


+ Tr 


OAt(C*) 


) dC 


/P(C)Tr 


A(C) +At(C*) 


dC 



(3.13) 



Actually, one could impose hermiticity on the kernel at the representation level via 
A ^ i(A + A"!"), but this can complicate the resulting stochastic equations for C — 
so, let us impose this only at the level of the observable moments. 

When samples are taken P{C) is interpreted as a probability distribution, and, 
lastly, noting that the expectation values of observables are real (since O is Hermi- 
tian), one obtains that the expectation value of an arbitrary observable O can be 
estimated from the samples by the quantity 



O 



where stochastic averages over samples are indicated by (■) 
is 



(Re jxr [oA 


}) + 

J 1 stoch 


l^Re jxr 


OAt 


^\ ^ stoch 


2 


{^Re |Tr 


A 


^\ ^ stoch 



stoch' 



(3.14) 



The correspondence^ 



lim O = (O). 



(3.15) 



3.3.2 Assessing estimate accuracy 

One expects that the accuracy of the averages in the ()3.14j) expression using S 
samples will improve as ^/S via the Central Limit Theorem (CLT), since they are 
just normalized sums over S terms. The uncertainty in a mean value v = {v)^^^^^ 
calculated with S samples can then be estimated by 



Av 



^')stoch - (^) 



stoch 



s 



(3.16) 



at the one a confidence level. 

There is, however, a subtlety when several averages over the same variables 
C are combined as in ()3.14|1 . because the quantities averaged may be correlated. 



^Provided there are no boundary term errors (see Chapter 1^. 
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Then the accuracy estimate ()3.16|) may be either too large or too small. A way 
to overcome this is subensemble averaging. While the best estimate O is still 
calculated from the full ensemble S as in ()3.14|) . the S samples are also binned 
into Se subensembles with s samples in each {S = sSe)- The s elements of each 
subensemble are used as in (I3.14j) to obtain independent estimates of the observable 
average {0^^\ 0^'^\ . . . , O^'^'^^}, distributed around O. One has (approximately) 



Given this, the CLT can be applied to the O to estimate the uncertainty in the 
observable estimate at the one a level: 



A practical issue to keep in mind is that both the number of samples in a 
subensemble s and the number of subensembles themselves should be large enough 
so that both: 1) The subensemble estimates O^''^ are reasonably close to the accurate 
value O ~ (O) so that ()3.17|) is true, and also 2) that there is enough of them that 
the right hand side of ()3.17p has an approximately Gaussian distribution, and the 
CLT can be applied to obtain ()3.18j) . 

An issue that arises when O involves a quotient of random variable averages as 
in ()3.14|) . is that the subensemble size s should be large enough that the denomi- 
nator is far from zero for all subensembles. Otherwise subensembles for which this 
denominator is close to zero have an inordinate importance in the final estimate of 
the mean. More details in Appendix O 

3.3.3 Calculating non-static observables 

The observable estimate ()3.14|) of Section 13.3.11 implicitly assumes that the explicit 
form of O is known a priori. When comparing with experiment, these are usually 
all the observable averages one needs. 

Nevertheless, in many theoretical works some other observables that do not fit 
this mould are considered. Perhaps the most common example of these is the mutual 




1 



(3.17) 



3 




(3.18) 
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fidelity between two states pi and p2 



Tr [piuhu] 



(3.19) 



Tr [piu] Tr [p 

2u\ 

which lies between zero (for two orthogonal pure states) and unity for pure pi = p2- 
A commonly considered special case is the purity -Fo(p) = F{p, p). 

A difficulty with evaluating such quantities as -Fo(p) or F(pi,p2) on states cal- 
culated with stochastic methods is that the density matrices are not known exactly, 
only their estimates in the form of S kernel samples. However, one can still proceed 
by expanding using ()3.1|) as pj = J Pj{Cj)Aj{Cj)dCj {j = 1,2) to give 



F{pi,P2) 



/Pi(Ci)P2(C2)Tr 


Ai(Ci)A2(C2)" 


dCidC 


2 


/A(Ci)Tr 


Ai(Ci) 


dCi J P2(C2)Tr 


MC2) 


dC2 



(3.20) 



(Note that the kernels (Ai and A2, respectively) used for representing the two density 
matrices do not have to be of the same form. The utility of doing so, however, 
depends entirely on whether the trace of the product of the two kinds of kernels can 
be evaluated in closed form for arbitrary sets of parameters Ci and C2.) 

As before, off-diagonal kernels can be used twice due to the hermiticity of density 
matrices, giving 



/Pi(Ci)P2(C2)Tr 




+ Al(Q))(A2(C2 




dCidC2 






dCiJP2{C2)TT 


A2(C2)+At(Q)" 


dC2 



F{pi,P2) -- 

jPl{C^)TT ^Ai(Ci) + A1(Q)J dC, j P2(C2)Tr ^A2(C2) + A^(C2* 

(3.21) 

The Pi(Ci) and P2(C2) can be interpreted as probability distributions of indepen- 
dently reafeed configuration samples. Denoting the first set of Si samples as {Ci^^}, 
and the second set of S2 as {Clf^^}, one obtains an estimate of the mutual fidehty 
between the two states represented by those two sets of samples as 



(3.22) 





Tr 


Ai(Cp^)) { 


A2(C?'^))+At([C?'^)]*)}" 


} 


2 E,, Re 


[Tr 


Ai(Cp^))" 




[Tr 


Mc^2''^) 


} 



(remembering that observable averages are real), with the large sample limit 



lim F = F{pi,p2). 

61-^00,62-^00 



(3.23) 
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Note that the sums in ()3.22j) are carried out over all pairs of samples Cp^'' and Cg"'^^ 

If the two states whose mutual fidelity is to be calculated arise in the same 
simulation, then one can ensure independence of the two sets of samples labeled by 
ji and j2 by setting aside half of all simulated samples for the set {C[^^^}, the other 
for {€2^^}. This separation of samples is always necessary if one wants to estimate 
purity Fq. 

Uncertainty estimates can be obtained in a similar sub ensemble averaging man- 
ner as outlined in Section I3.3.2| by working out ()3.22|1 for each subensemble, then 
using the CLT to estimate uncertainty in the mean of the subensemble estimates, 
by the same procedure as in ()3.18p . 

Finally, a major disadvantage of trying to work out fidelity estimates in the above 
manner from such stochastic simulations is that one must keep all the trajectories 
in computer memory, so that after all samples have been produced they can be 
combined in all possible pairs to evaluate the quantities Tr Ai(Cp'))A2(C^^'^) . 
Compared to a simulation that only considers static observables as per fl3.14|) . this 
increases the space required by a factor of S. 



3.3.4 Overcomplete vs. orthogonal bases 

Consider observable calculations when using local orthogonal bases for the local 
kernel operators A^. In a mode formulation, this would imply that the local param- 
eters Ck consist of discrete quantum numbers for the kth mode (e.g. occupation), 
while for a particle formulation could, for example, consist of (continuous) po- 
sitions of the kth particle, since all position eigenstates are orthogonal. Denoting 
this basis as |C{.)^, and remembering that off-diagonal kernels should be used (see 
Section in.2.3j) . a typical local kernel (omitting global parameters Cgiob) will have the 
form Ajt = \C'i^)f^ (C^|fc as in ()3.4|) . and the local parameter set is = {C{., C{.}. 

Suppose one wants to calculate the expectation value of a local observable O = 
Ok^k'jtklk' that is diagonal^ in this |C^)^ basis. The observable estimate expression 

^Or that contains some components diagonal in the basis |C'^)j.- 
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()3.14|1 is proportional to averages of quantities like 



Tr 



OA 



Tr 



OkAk 



/globl^glob) Y\_ '^^ [^^ 
k'^k 



(3.24) 



If any one of the local kernel samples A^/ or A^ are non-diagonal, then the whole 
A^-subsystem sample contributes nothing to the observable estimate, even if the 
off-diagonal kernel samples are for a different subsystem than the one in which the 
local observable is being considered (k). In a large system, practically all samples 
are likely to have at least one subsystem in which the local kernel is non- diagonal^. 
For example, if a proportion pod < 1 of samples of each subsystem are off-diagonal 
then only a proportion (1 —pod)^ will contribute to observable estimates. The result 
— no reasonable estimate of observables despite lots of calculation. Similar effects 
occur for off-diagonal local observables (in this case for a sample to contribute all 
k' k subsystems must have diagonal kernel samples, while the kth subsystem 
sample must have diagonal OfcA^.), and for observables O = ®jOj spanning several 
subsystems (in this case Ojkj must be diagonal for all j). 

A related issue arises in path integral methods (based on particle positions, for 
example), and it is the reason that the M. configurations in one "sample" must form a 
ring polymer structure (C(o) = C(^m)) if o^is wants to calculate diagonal observables. 
Off-diagonal observables (e.g. momentum distributions in position eigenstate bases) 
require a separate simulation, which contains some open polymer structures. 

This inefficient sampling is the main reason why simulations based on particle 
positions, or occupation numbers at lattice sites, are not usually successful in phase- 
space distribution based methods. In both cases the position eigenstates or Fock 
occupation number states used are orthogonal. 

To deal with this sampling problem, one can use overcomplete basis sets (in 
which the basis vectors are not orthogonal) for the description of the subsystems. 
Then the local off-diagonal kernels A^ can usually be explicitly normalized so that 



Tr 



A, 



1. The traces averaged in ()3.14|) become simply 



Tr 



OA 



Tr 



/glob(Oglob), 



(3.25) 



Apart from some special cases, of course. 
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and only local kernel parameters Ck are relevant for the calculation of local ob- 
servables. (Possibly apart from some global factors, which do not lead to sample 
contributions decaying exponentially with A^). 

The majority of this thesis considers methods based on coherent state expan- 
sions, which are of this kind. A non-orthogonal approach for particle-conserving 
systems that does use a position wavefunction is to write the density matrix as 
a distribution over off-diagonal projectors onto orbitals (with arbitrary wavefunc- 
tions, not necessarily orthogonal) occupied by all particles, as recently developed 
by Carusotto et a/[l]. 

Overcomplete expansions go a long way towards working around the problem, 
but not all non-orthogonal kernels that one might want to use can be normalized. A 
case in point are kernels that have zero trace for a set of parameters of measure zero. 
In such a case one should definitely not normalize (at least not fully) as boundary 
term errors tend to result (see Chapter A partial normalization that avoids 
any singularities in the resulting kernel (potentially caused by normalizing a kernel 
whose trace tends to zero) may be successful in such cases. However, the inefficient 
sampling can then recur. The reason is that there is then an effective "overlap" 
range in configuration space, such that if the "ket" and "bra" parameters C{, and 

differ by more than this range, the resulting kernel will have minimal weight in 
observable averages in comparison with more diagonal kernels. If the phase-space 
distributions of and have spreads significantly wider than the overlap range, 
a proportion ~ C^(l) of samples again have largely zero weight in moment 
calculations, leading again to only ^ (1 — Pod)^ significant samples overall. This 
situation can be additionally insidious because the weight of the samples is never 
exactly zero. This leads superficially to finite averages in the observable expression 
()3.14|) . however if 5(1 — Pod)^ ^ 1 there will be rare "very diagonal" configurations 
that are not sampled at all, but contribute a great deal to the averages in the limit 
S ^ oo. Conclusion: Care must be taken with un-normalized kernels to make sure 
no sampling bias is introduced. 

Summarizing, barring possible developments that may be able to work around 
the sampling problem discussed in this section, it appears to be necessary to use 
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overcomplete basis sets for the local kernels to avoid wasting a lot of computing 
resources on samples that do not end up contributing to observable estimates. It 
also appears desirable to make the trace of the local kernels normalized to unity if 
possible. 



3.3.5 Positive P distribution example continued 

Returning to the positive P distribution example of Section K12.4^ all operators of 
supported states can be written as linear combinations of the moments of the local 
annihilation and creation operators a^, a|.. Thus, to evaluate any observable it 
suffices to know how to evaluate an expectation value of a general product of the 
form ®ja^^, ®kO^L'^. (The Lj and L'f^ are subsystem labels (not necessarily unique), 
while the j and k are "subsystem label counters" ) . 

The coherent states forming the overcomplete kernels are eigenstates of the an- 
nihilation operators ak- 

Ofc = a/c , (3.26) 

and kernels are normalized: 



Tr 



1. (3.27) 



Now the same procedure as in Section 13.3.11 for ()3.14j) if followed, omitting only 
taking the real part of the numerator and denominator because ®a} a is (for 
convenience) not necessarily a strictly Hermitian observable. This leads to the ex- 
pectation value estimate 

[o.Zo) 



\ J J k / stoch 

Hermitian observables are constructed by combining the operator products and 
stochastic averages of ()3.28p with their adjoints and complex conjugates, respec- 
tively. 
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For the fidelity calculations of Section 13 .3 ..^^ one finds 



Tr 



A(ai,/3i)A(a2,/32) 



Tr 
Tr 



Afc(aifc, Afc)Al(tt2fc>/?2fc) 



J] Tr [Afc(aifc, Pik)M(^2k, P2k)] (3.29a) 

k 

exp[-{aik - a2k){/3ik - P2k)] (3.29b) 

exp [-{aik - f3*2kmk - a;,)] (3.29c) 



for two kernel samples described by variables aik, Pik and a2k, P2k respecively. 
The coherent state basis is non-orthogonal: 



(/3*|a) = e"'3-il"l'-*l^" 



(3.30) 



and overcomplete: 



|q;)(q;| d'^a = In. 



(3.31) 



(Note: the notation (Pv = dKe {v} dim {v} is used. 



3.3.6 Non-uniqueness of distributions 

When kernels with distinct parameters are non-orthogonal, as is the case when using 
local overcomplete basis sets, a given quantum state p can be described by a whole 
family of different distributions P{C). 

A simple example is the vacuum state with the aforementioned positive P dis- 
tribution. Any density matrix that satisfies 



(0|p|0) = Tr 



(3.32) 



must be the vacuum state |0) (0|, since ()3.32p specifies that the population of any 
non-vacuum states must be zero. Expanding as in ()3.1|) gives 



P{C){0\^{C)\Q)dC = PiC)TT A{C) 



dC, 



(3.33) 



which for the positive P distribution is (using J P{C)dC = 1) 



J P+(a,/3)e-"-^d2iVQ:rf2A^/3= 1, 



(3.34) 
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A class of P distributions that satisfy this are Gaussian ensembles of ak = Pk for 
each subsystem k: 

P+{cx,(3) = ]\^Pk{ak,l3k), 
P+k{ak,Pk) = S\ak-Pk){^)e'\^^\'/'-l 

for arbitrary positive real standard deviations cr = {ai, . . . , ctat}. This can be 
checked by substitution into ()3.34p . The most compact such distribution is of course 
in the limit of cr 0, when 

P+(a,/3) = 52^(01)52^ (3.35) 

One consequence of this is that the variables ak and (3k do not necessarily corre- 
spond to any physical observable. For example the "coherent state amplitudes" ak 
may be highly nonzero, despite representing the vacuum. Only when they appear 
in the combinations allowed by ()3.14|) is there a physical meaning. As an example, 
the estimator of occupation number (rik) is given, from ()3.14|1 . by 

fik = {^e {akPk}), toch 

= j P+ia, f3) Re {atPk} d^^'a d^'^fS 

= J P+kiak, Pk)'Re{akPk} d'^akd^ Pk 

■l"^l'/2'^^(ReK}2-ImK}2)rf2a, 

= , (3.36) 

the last line following from Re {ak} ^ Im {ak} symmetry. In practice, however, it is 
not irrelevant which distribution ()3.35|1 is used, because the low-cr distributions are 
much more compact and thus give much better accuracy when calculating observable 
estimates from a finite sample. 

Generalizing, some consequences of the non-uniqueness of distributions based on 
overcomplete basis sets are expected to be: 

• The variables in the sampled set C will not necessarily correspond to any 
physical observables. Only when they appear in the observable combinations 



oc / e-l^ 
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allowed by ()3.14|) is there a physical meaning. Because of this, such distribu- 
tions are then often said to be in a "non-classical" phase space. (Also because 
of the number of variables (at least 4) specifying a local off-diagonal kernel 
(classically only two - e.g. position and momentum) ). 

• The same state may be represented by many different distributions of the vari- 
ables. However, some of these will give more precise estimates of observables 
than others^. 

• Some distributions representing the same state may be narrower than others in 
certain directions of phase space, while broader in other directions. This means 
that a distribution may be better than another for estimating one observable, 
but worse for a different one. 

• Such differences in practical efficiency open the way for various optimizations, 
which here will be called stochastic gauges. See Chapters IH and [3 

• It is generally more difficult to obtain an explicit expression for P{C) given an 
initial state. If the kernel is orthogonal in a basis |C"), thus A(C = {C, C'}) = 
\C') {C'\, obtaining the unique -P(p) is easy: P{C) = {C'\p\C'). In an over- 
complete basis, one can obtain expressions such as but these are usually 
only one of many possible. 

3.4 Equations of motion 

To simulate the evolution of the state one must make correspondences to the quan- 
tum master equation^, analogous to those made for the density operator in the 
previous sections. The master equation results in a differential equation'' for P{C). 
This can then often be made to correspond to stochastic equations for the configu- 
ration variables C, which specify the evolution of the S initial samples. 
^Given the same number of samples. 

^Or, possibly different non-Markovian kinds of quantum evolution equations. In this thesis, 
however, only master equations are considered. 

partial differential equation if C is continuous, or a set of coupled (possibly partial) differ- 
ential equations for the various probabilities P{C) if C is discrete. 
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Now, while some equation for P{C) can always be obtained, in its raw form it 
is not very useful, because solving it directly for P{C, t) may be as (or even more) 
computationally intensive as solving the master equation directly. For example for 
continuous variables, one would try to evaluate the evolution of P{C) on a lattice of 
(say) Miatt points per variable. Since the number of variables in the set C is linear 
in A^, the number of lattice points on which to evaluate the evolution of P{C) would 
scale as M^^^^ — exponential in A^, just like a brute force density matrix element 
approach. 

One needs to make a correspondence between the evolution of P{C) and stochas- 
tic evolution of C, its samples. This is not in general a trivial (or perhaps even a 
feasible) exercise, but when P{C) obeys one of the broad class of Fokker-Planck 
second order partial differential equations (FPE), there is a well-known "textbook" 
method inSl in2] to obtain stochastic equations for C. This is the approach that will 
be henceforth considered here. 

It is worth noting that there may also be ways of obtaining stochastic equations 
for the configuration variables C for different kinds of equations. One direction 
that has been recently investigated by Olsen et al f)3| are differential equations for 
P{C) involving third order partial derivative terms. The resulting equations have 
stochastic terms with different statistics than the Wiener increments that will be 
used here. The results are promising but there appear to be serious numerical 
stability problems. 



3.4.1 Master equation to Fokker-Planck equation 

Let us investigate in more detail what exactly is required to obtain an FPE for P. 
These are of the general form 

j J J 

A point on notation: here, the indices j, k will label all system configuration variables 
Cj consecutively, not implying any relationship between Cj and a jth subsystem. 
The first (rather trivial, but significant) comment is that since an FPE has terms 
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of the differential form dP{C)/dCj, the system variables Cj should all be continuous 
so that partial derivatives can be defined. 

Additionally, not all combinations of master equations and operator kernels can 
lead to Fokker-Planck equations. Let us investigate what the exact requirements 
are. All master equations consist of terms % linear in the density operator, and can 
be written as 



dt 



(3.38) 



with the Ti composed of combinations of system operators Aj and linear in^ p. When 
the kernel depends on continuous parameters, such system operators and the kernel 
can often be found to satisfy mutual second-order differential identities of the forms 



AjA{C) 



k kl 

k ^ kl HI 



A(C) (3.39a) 
A(C) (3.39b) 



where all differential operators are henceforth defined to act on the right, (a well- 
known example is in ()3.57j) for the positive P representation.) If, however, higher 
than second order differential terms must also be included, no FPE can be obtained. 

Assuming ()3.39|) hold, by substitution of them into the forms of the Ti[{Aj},^), 
differential expressions for ^^[A] can be found (note that now Ti acts on the kernel 
rather than the density matrix itself). Provided that the master equation does not 
contain terms of too high order in operators for which the differential terms are 
nonzero, then the operation of each term on the kernel will be able to (after some 
algebra) be written in a form: 



HA' 



Ti'\C) + Y.Til\Cf)4, + \Y.TU(C) 

jk 



3 



d_ d_ 

dCj dCk 



A. (3.40) 



The factor \ in second order terms is introduced for later convenience. 

It is crucial that this expansion in partial derivatives terminates at second order 
for all terms otherwise the final partial differential equation for P will not be of the 



^Or, equally well, linear in an un-normalized 
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FPE form ()3.37|) . This usually creates a restriction on the complexity of processes in 
the model that can be considered. For example in the gauge P representation used 
in later parts of this thesis, one-particle and two-particle interactions are fine, but 
three-particle collisions cannot be treated. This restriction can often be surmounted 
if one increases the complexity of the kernel, at the cost of additional complexity in 
the equations. 

Having confirmed that the identities ()3.40j) hold with no 3rd or higher derivative 
terms, the FPE for P can be derived as follows. Firstly, note that only kernels 
that are not explicitly time-dependent are being considered here, although time- 
dependent kernels can also be treatable by a similar, but more voluminous approach. 
Having said this, the master equation expanded as ()3.38p can be written using ()3.40|) 
as 

dP{C) 



X 



Ti"Hc) + Y.Til\C)^y^Y.TSKc) 



We now integrate the right hand side by parts, which gives 
dP{C) 



_d d_ 

dCj dCk 



(3.41) 
A(C) dC. 



X 



^/^^)-E7|rT«(C) + i$: 



d d 



9Cr'^ ' 2^dCkdCj ^^^^ ' 



(3.42) 

P[C) dC 



3 ■' jk 

provided that boundary terms can be discarded. This last is unfortunately not always 
true, most typically when the distribution P{C) has power-law tails as the \Cj\ head 
to infinity. ChapterlHlis devoted to this issue and ways to ensure that these boundary 
terms are forced to zero using the stochastic gauge technique. Section 16.1.11 derives 
the exact form of the boundary terms (jfi.lj) . which must be zero for the derivation 
to succeed. Also, there have been developed numerical tests that allow one to check 
if nonzero boundary terms may be a problem |64j. 

Now, there may be many P{C) that satisfy ()3.42j) . but one certainly is 



dP{C) 



E 



j jk ■> 



d d 



P{C). (3.43) 
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This is almost an FPE, except for the possible terms constant in P. These typically 
appear when the normalization of p is not conserved. It turns out that in many cases 
the constant terms are zero, and one can directly obtain a Fokker-Planck equation 
of the form ()3.37|) . noting the correspondences for the drift vector and diffusion 
matrices 

MC) = J2^ifiC) (3.44a) 

DAC) = Y.^S1{C). (3.44b) 

I 

Nonzero constant terms can be easily treated using a gauge method described in 
Section Ol 

3.4.2 Fokker-Planck equation to stochastic equations 

An appropriate set of diffusive stochastic Langevin equations for the variables Cj 
is known to be equivalent to the FPE for the distribution P{{Cj}) in the limit of 
infinitely many samples of the variables Cj. This textbook topic is described in detail 
e.g. in Gardiner [SnilS21- An important qualification is that this correspondence only 
holds if the diffusion matrix Djk{C) when written for real variables Cj is positive 
semidefinite^ . 

If this is the case, then Langevin equations for the (real) Cj, equivalent to the 
FPE (lOTjl . are 

dCj{t) = Aj{C, t)dt + J2 BjkiC, t) dWk{t) (3.45) 

k 

in the Ito calculus. Here, the noise matrices Bjk are any matrices that satisfy the 
diffusion matrix decomposition 

D = BB^, (3.46) 

i.e. in terms of matrix elements 



Djk — BjiBki. 

I 

^i.e. has no eigenvalues with negative real part. 



(3.47) 
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The dWk{t) are real stochastic Wiener increments (i.e quantities satisfying the large 
sample (iS ^ oo) averages) 

{dWMstoc^ = (3.48a) 
(rfPF,(t)diyfc(t')),toch = S,kS{t-t')dt\ (3.48b) 

For the small but finite time steps At — ^ dt in a calculation, dWk{t) are usually 
implemented by real Gaussian noises^'' independent for each k and each time step, 
having mean zero, and variance At. Some more detail of the numerical implemen- 
tation of stochastic equations with Wiener increments can be found in Appendix iBl 
Lastly, it was chosen to write the equations immediately in difference form, as this 
is the form in which actual computer calculations are made. 

Now, clearly all the coefficients Aj and Bj^ must be real, since the equations 
are for real variables Cj. However, the decomposition ()3.46|1 in terms of real B is 
only possible if the symmetric diffusion matrix D{C) is positive semidefinite. This 
explains the reason for the positive semidefinite condition on D. 

Finally, strictly speaking, a more general set of stochastic equations that remains 
equivalent to the FPE ()3.37|) is possible, and will be of use in later parts of this 
thesis. Consider that only the means and variances ()3.48|) of the Wiener increments 
are specified. If one writes ()3.45p as 

dCj{t) = Aj{C, t) dt + dXj{C, t) (3.49) 

then by the properties of Ito stochastic calculus, the only binding relationships for 
the stochastic terms dXj are that 

(^^.(^))stoch = 0, (3.50a) 
(rfX,(t)rfXfc(t')),toch = {D,k),,,,Jit-t')dt'. (3.50b) 

Indeed, dXj{t) need not even be Markovian (i.e. independent of quantities at times 
t)j provided the conditions ()3.50|) are satisfied. The relationship between the 

"'^''Other noise distributions are also possible, simply provided the conditions (|3.48(l are satisfied. 
Since by the CLT the sum of several infinitesimal noises will always approach an infinitesimal 
Gaussian-distributed noise anyway, it is usually more efficient to start with a Gaussian noise right 
away, rather then make one the hard way by summing several non-Gaussian noises. 
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Fokker-Planck equation ()3.37|) and stochastic equations ()3.49|1 is considered in a 
more rigorous fashion and covered exhaustively in the texts by Gar diner [SHI 102], but 
equations ()3.45p through to ()3.50p will be sufficient for the considerations of this 
thesis. 



3.4.3 Ensuring positivity of diffusion for analytic kernels 

In the important case of a kernel that can be written as analytically dependent on 
complex parameters Zj (rather than on real CjY^, then we can use the freedom of 
interpretation of derivatives with respect to complex arguments to choose such a 
Fokker-Planck equation that the diffusion matrix D is always positive semidefinite, 
and Langevin stochastic equations ()3.45|) can always be derived. The procedure 
described below is simply an application of the same method used for the special 
case of a positive P distribution by Drummond and Gardiner 

This freedom of derivative choice arises from the property of all analytic functions 
f{zj) that 

dfjz,) dfjz,) dfjz,) 

dz, ~ dz'^ ~ ' dz'; • ^^-^^^ 

(Where real and imaginary parts of the variable have been denoted for brevity by 
the shorthand Zj = Zj + iz'J.) 

The derivation in Section 13.4.11 follows through formally without change if the 
real variables Cj are replaced with the complex zj, the set of parameters now being 
C = {zj}. Using ()3.51|) the terms in ffist order derivatives of the kernel in ()3.40|) 
can now be equated to 



I^e{T«(C)}A + Im{T«(C)^ ' 



dz; 



AiC). (3.52) 



To deal with the second order terms, one can decompose T^^l = Djk = BjpB^p 
into the noise matrix forms (which will in general be complex), use the shorthand 



11 



i.e. A depends on Zj but not on any z*. 
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B = B' + iB", and express them (all together) using ()3.5H1 as 

I p ■' 

p [^jp^kpdzr 8z'^ ^jp^kpdz'. 8z'l ^ ^iV^kpdz'l 8z'^ ^jp^kpdz'J a^'J ^H'-^J- 

(3.53) 

Let us see what has happened to the diffusion matrix for real variables, as it is 
this that must be positive semidefinite for Langevin equations ()3.45|1 to be obtained. 
While the indices j, k,p = 1, . . . , have referred to complex variables (of number 
Nz), let us also define real variables Vj> (with primed indices j', k',p' = 1, . . . , 2Nz) 
by Zj = Vj and = t>(j_|_Ar^). Denoting the diffusion and noise matrices of the real 
variables vj' as D(^) and B^^\ respectively, we have D^f^, = EjS^jv^I'y. and 
from dSm: 

5]$ = Re{i?,p}, (3.54a) 
B%^^^ = lm{B,,}, (3.54b) 
B^U = B^N..^N. = ^. (3.54c) 

which are all explicitly real. Thus, the diffusion matrix is explicitly positive semidef- 
inite, and Langevin equations for all the Vji can be obtained. 

From ()3.45j) . ()3.52|1 . ()3.44a|l . and ()3.54|1 the resulting stochastic equations for the 
real variables then are 

dvjit) = Re {AjiC t)} dt + Y^ Bj^liC, t) dWk{t) (3.55a) 

k 

dv.+^Xt) = ^^{MC.t)} dt + Y,Bfl^^^f,{C,t)dWu{t), (3.55b) 

k 

which can be written in shorthand form for the complex variables Zj as 

dzj{t) = Aj{C, t)dt + J2 Bjk{C, t) dWk{t). (3.56) 

k 

Note that the Wiener increments dW^ remain real. 

The only requirement (apart from obtaining an FPE) to carry out this procedure, 
which ensures a stochastic equation interpretation, is that the kernel be written as 
an analytic function of complex variables. 
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Finally, the procedure described in this section can be carried out on only a 
part of the total set of variables C, if the kernel can be made analytic in only this 
part. Provided the resulting combined noise matrix B^'"^ for all the real variables of 
the system (real and imaginary parts of the complex variables, and the remaining 
real variables) is real, the diffusion matrix is positive semidefinite, and Langevin 
equations for all the variables can still be obtained in the same way as here. 



3.4.4 Positive P example: dilute lattice Bose gas equations 

Continuing with the positive P representation example, let us obtain the stochastic 
equations corresponding to the quantum dynamics of the lattice interacting Bose 
gas Hamiltonian ()2.17|) with single-particle losses to a zero temperature heat bath. 
The master equation is ()2.20j) with Linblad operators ()2.2H) . and can be written in 
terms of p and the creation and destruction operators On, which can be identified 
with the Aj of Section 1!^. 4. II 

These can be easily verified (via (j3.9|) and ()3.26|) . and the mutual commutation 
of local kernels An) to obey the identities 



OnA 


= CtnA, 


a|jA 


= (/5n 




= /3nA, 




= f ttn 



d 



_d_ 



(3.57a) 

A, (3.57b) 

(3.57c) 

A (3.57d) 



for all lattice points n. It can be seen that to obtain an FPE there can be 
up to two operations of the kind a^p or pa per term in the master equation, and 
any number of ap or pa'' . This allows any one- or two-particle processes but rules 
out three-particle processes, which contain terms of third order in a or in the 
Hamiltonian. 
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The terms in the master equation can be sorted (according to processes) as 

Ti[p] = -i^ujnm[aia^p-pala^], (3.58a) 

nm 

Up] = -^xJ2[^n^nP-P^i'K], (3.58b) 

r 

n 

Which, using ()3.57|1 . gives 




(3.59a) 



r,|A| = _.,^(2„jA£-2a„^S|- + aS|^-^SgA,(3.59b) 

Proceeding in the same manner as in Section 13.4.31 (remember that the and 
Pn variables are complex), we obtain the (Ito) stochastic equations 

dan = —i ^^nmO^indt — 2ixa\l3ndt — "^andt + ian^/2ixdWa{t) , (3.60a) 

m 

dpn = ^5^cu^n/9m(it + 2^xan/5nC^^- y/5ncit + /3nv^f^W^n(t). (3.60b) 

m 

Here the 2N real Wiener increments dWn{t) and dWn{t) are all independent at 
each time step. We have used the "simplest" diagonal noise matrix decomposi- 
tion Bnm = -Bnn'^nm here. More on the possibilities with these decompositions in 
Chapter H 



-palar 



(3.58c) 



3.5 Convenience for parallel computation 

There is no cross-talk between separate realizations of the stochastic process in the 
simulation scheme described in this chapter. This is a highly desirable property 
for numerical realization because only one trajectory at a time need be stored in 
memory. In fact, many independent trajectories could be run at the same time on 
different computers in a cluster, or even on entirely separate computers, and then 
collected together at the end. The same applies to calculating observable averages 
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()3.14|1 , where all one needs is to keep track of the running sum of averaged quantities 
and their number^^. 

In combination with the linear scaling of variable number this makes for a po- 
tentially powerful and efficient combination for many-body simulations. 

An exception to this convenient scaling with trajectory number are non-observable 
quantities such as fidelity, estimated by methods like in Section TS.H.HI In that kind of 
calculation we have to keep in memory the variables for all s samples in a subensem- 
ble to use all pairs of them in the calculation ()3.22j) . Memory requirements increase 
by a factor of s to hold the entire subensemble. The actual calculation of estimates 
like ()3.22|) also takes a time oc as opposed to oc s for static observable estimations 
()3.14j) . Since for reasonable subensemble averages one typically needs (9(100) or 
more samples to a subensemble, this requires a increase of two orders of magnitude 
in computing resources — highly nontrivial. Fortunately, at least the linear scaling 
oc of the number of variables remains untouched. 



3.6 Summary of representation requirements 

This chapter has shown or strongly indicated that for a representation to be useful 
for quantum simulations of many-body systems, it must satisfy the requirements 
listed below: 

1. Non-negative real distribution: To interpret P{C) as a probability distri- 
bution over kernel operators A(C), it must be real, normalizable to unity, and 
non-negative. 

2. Non-singular distribution: Needed for finite probabilities and un-biased 
dynamic sampling, although delta functions in the initial state can be toler- 
ated. 

3. Complete: The representation must be able to describe the initial states, 

and any states reachable by the subsequent evolution, while satisfying the 

"'^^And possibly the sum of the squares of the averaged properties for assessment of uncertainty 
in the estimate as per (|3.18f) . 
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above conditions. In practice this usually implies that the kernel must form 
a complete or overcomplete basis, and that the kernel be off-diagonal. See 
Section 13.2.31 

4. Linear scaling: The number of system variables in C must scale linearly 
with the system size (Typically number of modes, particles, orbitals,. . . ). 
In practice this requires the kernel to be a tensor product over local kernels 
Afc for each subsystem, or a linear combination of a small (<^ N) number of 
such tensor product terms. See Section 13.2.21 

5. Most samples significant: The majority of samples (i.e. O (S)) should 
contribute significantly to all observable estimates of interest, particularly ob- 
servables local to the subsystems. This appears to require the use of an over- 
complete basis, and kernels normalized locally (i.e. Tr =1), perhaps 
apart from some global multiplying functions. These issues are discussed in 
Section 13.3.41 

6. Finite kernel trace: All configurations C must lead to a finite kernel trace, 
otherwise observable averages will be undefined for 5 ^ oo and biased for 
finite S due to divergence of the denominator in ()3.14j) . 

If the stochastic interpretation of the quantum evolution is obtained via a Fokker- 
Planck equation as in Section 13.41 (presently the only approach that has led to 
nontrivial simulations), then the following requirements must also be satisfied: 

7. Differential equation: The configuration parameters C should be continu- 
ous to allow a differential equation for P{C). 

8. Fokker Planck equation: The mapping ^ must generate only first 
and second order partial derivatives of P{C). See Section 15.4. 11 

9. Positive propagator: The short time diffusion in phase space must be non- 
negative (i.e. the diffusion matrix for real variables D^""^ has no eigenvalues 
with negative real part), to allow a stochastic interpretation. This can be 
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guaranteed if the kernel is an analytic function of complex variables (see Sec- 
tion EI^SI), although in some situations non-analytic kernels may also lead to 
satisfactory diffusion. 

Furthermore, there are several additional requirements that appear generically in 
simulations, and must be satisfied to avoid bias and/or unmanageable noise in the 
observable estimates. 

10. Stable trajectories: If trajectories in phase space diverge rapidly, so does 
the width of the distribution P{C), and all useful precision in the estimates 
of observables is lost. 

11. Unbiased: Pathological cases are common (particularly in nonlinear under- 
damped systems), where the distribution P{C) is too broad to allow unbiased 
sampling of some or all observable moments — typically when P{C) devel- 
ops power-law tails. These "boundary term" errors are considered in detail in 
Chapter and usually go in tandem with unstable trajectories. 

12. UV convergent: Numerical simulations almost always require one to approx- 
imate space by a finite lattice. This does not change the physical predictions of 
the model provided the lattice spacing is much finer than any other significant 
length scale in the model. This, in turn, requires that the effect of vacuum fluc- 
tuations on the equations of motion and observable estimates abates at large 
momentum (i.e. at small length scales). A mode-based lattice calculation will 
typically be non-UV-convergent if there are noise terms in the equations that 
do not disappear at zero mode occupation. 
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Chapter 4 
Stochastic gauges 

The non-uniqueness of representations P{C) leads to many potential sets of stochas- 
tic equations representing the same single master equation. In this chapter, the 
stochastic gauge formalism is introduced, which systematically characterizes the 
freedoms available in the stochastic equations after the physical model has been 
completely specified. The term "stochastic gauges" is used here because of an anal- 
ogy with electromagnetic gauges: In both cases the equations contain arbitrary (in 
principle) functions, which have no effect on physical observables, but can have a 
great influence on the ease with which the calculation proceeds. In the case of 
stochastic gauges, the choice of gauge has no effect^ on observable quantum expec- 
tation values ()3.14j) calculated in the limit of an infinite number of samples. For 
a finite number of samples, however, different gauges affect how rapidly this large 
sample limit is approached. It will be shown that an appropriate gauge choice can 
improve simulation efficiency by many orders of magnitude, and also correct biases 
caused by pathological distributions. 

During the derivation of stochastic Langevin equations ()3.45|) for the variables 
Cj from a master equation for the density matrix, there are two distinct places where 
stochastic gauges can be introduced. Firstly, null differential identities on the kernel 
A, multiplied by arbitrary functions and integrated over P{C) can be introduced 
into the master equation at the point ()3.4H1 . which leads to JF finding its way into the 

^In principle. In practice there is also the issue of boundary term errors, which may be present 
or not, depending on the gauge chosen. This is discussed in detail in Chapter El 
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FPE, and into the final stochastic equations. These "kernel stochastic gauges" are 
described in detail in Sections 14.11 to 14.31 Secondly, the diffusion matrix of an FPE 
does not uniquely specify the noise behavior of the resulting stochastic equations. 
These degrees of freedom can also be used to introduce arbitrary functions g into 
the noise matrices B, as described in detail in Section |33] A partly restricted set of 
kernel and diffusion gauges, which will be here dubbed the "standard" gauges, and 
are sufficient for most purposes, are summarized in Section 1^31 

As in the previous Chapter El the derivation is kept as general as possible 
with the aim of sifting out those elements that are truly necessary for stochastic 
gauges to be present and to be useful. Several authors have recently (all starting 
in 2001) proposed phase-space distribution methods that make use of the freedoms 
that are here dubbed stochastic gauges. This includes the "noise optimization" of 
Plimak, Olsen, and Collettj^, the "stochastic wavefunction" method of Carusotto, 
Castin, and DalibardPinn], and the "stochastic gauges" of Deuar, Drummond, and 
Kheruntsvan[3t l66 t 15 6 | IHT] . All these are unified within the formalism presented in 
this chapter, which is itself a generalization of some ideas to be found in the articles 
by Deuar and Drummond [3| and 



4.1 Generalized kernel stochastic gauges 



Whenever there are null differential identities on the kernel of the general form 

A{C) = j\a{C)] =0,(4.1) 



dCdCk 



3 •' jk 

a stochastic gauge can be introduced in the following manner: 

Since (14.111 is zero, so is its integral when multiplied by non-divergent functions. 
In particular. 



dC = 



(4.2) 



with the function JF(C, t) otherwise unspecified. Now one adds zero (in the form 
()4.2|) ) to the right hand side of the master equation ()3.41|) in the derivation of the 
Fokker-Planck equation of Section 13.4.11 (Actually, to be fully rigorous, one must 
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make sure that the full integral / -P(-)A over the entire master equation ()3.41|1 plus 
all "zero" terms ()4.2|1 is convergent, otherwise boundary term errors may result. 
This issue is considered in detail in Chapter so for now let us defer this and 
assume that no boundary term errors are present.) 

Adding ()4.2j) introduces an additional master equation term that can be written 
in the notation of Section IH.4.11 as 



J 



A 



A 



(4.3) 



Following now the procedure of Sections 13.4.11 and ()3.4.2|) . the result is that the 
coefficients in the Langevin equations ()3.45j) become modified to 

MC)^ Af\c)= A,{C)+nC)jf\C) (4.4a) 
DAC)^ D%\C)= D,,{C)+:F{C)jf^{C). (4.4b) 

In this manner an arbitrary function JF has been introduced into the stochastic 
equations. The number of these is unlimited provided we have at least one identity 
of the form ()4.1|) . although there will be at most as many gauges with distinct effects 
on the equations as we have distinct identities. 

The whole procedure is based on properties ()4.1|) of the kernel, and so the kernel 
gauges that can be used in a particular simulation will depend on the local subsystem 
bases used to expand the density matrix, and on how the kernel is constructed out 
of these. 

Some (though certainly not all) broad classes of gauges that are possible with 
various kernels include: 

• Analytic: The procedure used in Section 13.4.31 to ensure a positive diffusive 
propagator for representations based on kernels analytic in complex variables 
Zj can be interpreted as using a gauge of the form 

^ • ^ ^A(;2,) = 0. (4.5) 



dKe{zj} dlm{zj} 
This applies for any kernel analytic in Zj. 

Weighting: If constant terms are present during the derivation of an FPE 
as in ()3.43|) in Section [3.4. 11 they can be converted to deterministic evolution 
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of global weights, provided the kernel contains a global factor using an 
identity of the form 



See Section 1121 below. 

• Drift: One can modify the deterministic part of the stochastic equations 
()3.45p in principle at will, provided compensation is made in the form of ap- 
propriate global trajectory weights. Building on the weighting gauge identity 
()4.6|1 for kernels with global weights Q, the identities 



follow for any variable Cj, and can be used for this purpose. See Section 11731 

• Reduction: For some kernels, identities can be found that reduce second 
order partial derivatives of the kernel to first order. These can be used at the 
level of expressions ()3.4()j) for each part 7/ of the master equation to reduce 
partial derivative terms obtained with the operator identities ()3.39p to lower 
order. This will result in a change from diffusion behavior in the FPE to 
deterministic drift, or even better, in transformation of third- or higher-order 
partial derivative terms (which prevent one from obtaining any FPE) to lower- 
order terms, which can form part of an FPE. The identities required are of the 
general form 



for some variables Cj, Ck, and Cp, and some function f{C). An example can 
be found in the recent work of Drummond and Corney on boson and fermion 
phase-space distributions jHH EH] 

4.2 Weighting stochastic gauges 

Consider the situation where during the derivation of the FPE in Section r3.4.1l one 
encounters nonzero constant terms Most typically, this situation 




(4.6) 




(4.7) 




(4.8) 
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occurs with un-normalized density matrices e.g. in the thermodynamic equation 
()2.29j) . As it stands, such an equation of the form ()3.43j) does not lead immediately 
to stochastic equations for the variables C. However, if the kernel contains a global 
weight factor Q, or by a simple modification is given one, this constant term can be 
converted to a deterministic evolution of this weight. 
What is needed is a kernel of the form 

A{C) = QA{C). (4.9) 

where the "base" kernel A (£7), containing all the local basis operators for subsys- 
tems does not depend on the global weight Q. For notational convenience let us 
define the logarithm of the weight by 

n = e^'\ (4.10) 

then the full parameter set is C = {Cq, C_}. Directly from ()4.9p . one can see that 
the gauge identity 

applies. If, now, one adds "zero" defined as 

J2 1 P{C)Tf'\C) - l) A{C)dC = (4.12) 

to the master equation in the form 1)3.411) . the original terms in t/^"* vanish, to be 
replaced by first order differential terms 

J2 J P{C)Tl'\C)^A{C)dC. (4.13) 

This can be incorporated into the formalism of the derivation in Section 13.4.11 by 
including labels j, k = in the sums J2k^ noting that now 

T«(C) = tI'\C) (4.14a) 

tSI = T[^=Tto^ = 0- (4- 14b) 

This then follows through to additional drift and diffusion coefficients ()3.44|) in the 

FPE (pn?7|i 

Ao{C) = J]r/°)(C:) (4.15) 

I 

D,o = Do,=Doo = 0. (4.16) 
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The new equation for Cq = logQ. becomes 



dCo{t) = Ao{C,t)dt 



(4.17) 



(with no noise), while the stochastic equations for all the remaining variables Cj 
remain unchanged. 

Lastly, the global weight can be made complex with no formal change to the 
above, if one needs to deal with complex constant terms t/^''. 

4.3 Drift stochastic gauges 
4.3.1 Mechanism 

Consider again a kernel with global weight Q = 6*"°, defined as ()4.9j) in the pre- 
vious section. The weighting gauge identity ()4.11|) implies further gauge identities 
involving each of the variables in C (including Cq itself): 



While ()4.11|) was used to convert constant terms in the "FPE-like" expression ()3.43|) 
to first order derivative terms (and hence deterministic evolution in the weight), 
the new identities can be used to convert first to second order terms. This allows 
conversion of deterministic evolution in a variable Cj to stochastic changes in the 
weight. Such a conversion can be highly desirable as will be shown in Chapters IHl 
and later. 

The procedure is similar to that employed in Section 14. 2^ however some modi- 
fications are made to make the final expression for the stochastic equations in Cj 
more revealing. If one adds "zero" , defined this time as 








(4.18b) 



(4.18a) 



J, A(C) 



(4.18c) 




(4.19) 
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with arbitrary gauges J^j for each variable Cj, then in the formahsm of Section 
additional terms J P{C)Tq A(C) dC = are introduced into the master equation, 
with the superoperator % containing coefficients 



-T,{C) 



= 2^o(C), 



(4.20a) 
(4.20b) 
(4.20c) 



where underlined indices have been used to indicate labels for the "base" variables 
such that j = 1,2,..., etc. Following the procedure through, the modified FPE 
coefficients 



A, 



00 



D_j_k 
2^0 



Do, 



(4.21a) 
(4.21b) 
(4.21c) 
(4.21d) 



are obtained (the original coefficients when Tj = are the underlined). The modi- 
fications to the deterministic evolution of variables are seen to be —J-'j dt. 

A difficulty with the forms 1)4.211) is that in general the form of the diffusion 
matrix elements Bjk that actually go into the stochastic equations will depend in 
some complicated manner on all the J^j. If the aim is to modify only the deterministic 
(drift) evolution of the variables C j (as will be the case in later chapters), one may 
end up with the side-effect of additional changes in the noises BjkdWk- This can 
prevent a clear assessment of what practical effect the change in the equations due 
a particular gauge will have. 

A convenient form to have the noise matrices B in would be to have no change in 
the base variables Cj apart from the modification of the drift, and define new (also 
arbitrary) gauge functions Qj (JF) that give the exact stochastic terms for the weight 
variable Cq, and do not introduce any more independent noises than were had in 
the original formulation. That is, if the prior weight evolution had no diffusion, one 
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can try the ansatz 

Bik = B^k (4.22a) 
Bok = Gk. (4.22b) 

Using D = BB^ , the diffusion matrix elements that would result from ()4.22|) are 

^ik = ''^RipRkp = Hik (4.23a) 

p 

p 
p 

Comparing these with ()4.2H) . one identifies 

^0 = \Y.^\ (4.24a) 



Note that there has to be a small restriction of the gauge freedoms from ()4.21|) to 
()4.24p (one less arbitrary function due to ()4.24aj) ) to achieve the convenient noise 
matrix B given by the ansatz ()4.22|) . 

The final stochastic Langevin equations with gauges Qk then are (noting that 
B_j_k = Bj^k) 

dCj_ = A^dt + ^B^kidWk-Gkdt), (4.25a) 
k 

dCo = Aodt + J^Gk (dWk -\Gk dt\ . (4.25b) 

The (Ito) equation for the actual weight VL = e'^° takes on a simpler form 

ci(] = + ^^fcdW^fcj . (4.26) 

with no new drift terms. Recapping, several assumptions beyond the identities 
()4.18p have been made to obtain ()4.25p . and these are: 
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1. The integral / P{-)AdC for the master equation in form ()3.4Hl . including 
terms containing the J^j, is convergent. 

2. The function JFq is determined by ()4.24a|) . rather than being an independent 
arbitrary function. 

3. The prior evolution of the weight contains no stochastic terms {Djq = Dqq = 0). 

Lastly, as with weighting gauges, if one is dealing with a kernel analytic in 
constant variables zj, the above derivation goes through for Zj instead of Cj with no 
formal change, provided the weight variable Cq = Zq = logQ is now also complex. 
The gauges Qk would then be arbitrary comjo/ea; functions. 

4.3.2 Real drift gauges, and their conceptual basis 

If one arbitrarily adjusts the drift behavior of the stochastic equations, there must be 
some kind of compensation to ensure continued correspondence to the original quan- 
tum master equation. When the gauges Qj are real, the modification of the weight 
can be understood in terms of compensation for the increments B_jj:{dWk — Qk) 
no longer being described a Gaussian noise of mean zero (in the many infinitesi- 
mal timesteps limit when the CLT applies). Let us first look at the simple case of 
one subsystem with one complex variable z, with the gauge- less weight is constant, 
and the drift gauge Q real. The equations for the real and imaginary parts of the 
variables z = z' + iz" and zq = log fl = z'q + izq are 

dz' = A'dt+ B'dW - B'gdt (4.27a) 

dz" = A"dt+ B"dW - B"gdt (4.27b) 

dz'o = -^g^dt + gdW (4.27c) 

dzl^ = 0, (4.27d) 

where we have used the shorthand B_' = Re {5} and B_" = lm{B_} etc. for all 
quantities, and omitted writing subscripts j,k = 1. 

Consider a very small time step dt in which the variables change from z{t) to 
z{t + dt) etc. This time step is small enough so that the stochastic equations for 
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infinitesimal dt apply, while large enough that the Wiener increment dW is Gaus- 
sian distributed due to the CLT. Suppose also that we consider just one particular 
trajectory so that the initial distributions of z and Zq are delta functions around 
their initial values. The probability distribution of the Gaussian noise is 



Pr{dW) = e 



-dW^/2dt 



(4.28) 



and if there is no gauge {Q = 0), the dependence of z'{t + dt) on the realization of 
the noise dW can be inverted to give 

z'{t + dt) - z'{t) - A'dt 



dW 



B' 



(4.29) 



Substituting this into ()4.28|) . we obtain an expression for the probability distribu- 
tion^ of z(t + dt): 

[z'{t + dt)- z'{t) - A'dt)^' 



P{z{t + dt)) = exp 



(4.30) 



2{B')^dt 

If we now arbitrarily modify the drift with a nonzero gauge, ()4.28j) still applies, but 
the inverted relation dW{z'{t + dt)) now becomes 

z'{t + dt) - z'{t) - A'dt + QB'dt 



dW 



B' 



(4.31) 



leading to a different probability distribution Pg{y{t + dt)) than the "correct" gauge 
less ()4.30p . Precisely: 

{z'it + dt) - z'{t) - A'dt + QB'dt)^ 



Pg{z{t + dt)) = exp 



P{z{t + dt))exp 



~ [gB'dt + 2{z'{t + dt) - z'{t) - A'dt)} 
IB 

(4.33) 



However, we can recover from this by introducing a global compensating weight f2 
for the trajectory, which is always applied in all observable averages so that 



VtPg{z{t + dt)) = P{z{t + dt)). 



(4.34) 



^Note that we use only expressions involving z' to characterize the probability distribution of 
the entire complex variable z. This is because z' and z" are completely correlated with each other, 
since only the one real noise dW determines them both. The imaginary part, z" , could have been 
used just as well. 
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Using ()4.3H) '^ we can write this compensating weight from ()4.33|) as 



n 



exp 



g I ~-gdt + dw 



(4.35) 



Identifying Q from the definition of the kernel ()4.9j) one sees that the drift gauge 
equations for dzo ()4.27c|l and (j4.27d|l produce exactly the right global weight to 
compensate for the arbitrary drift corrections introduced by nonzero g. 

For the many-variable case with base variables Zj plus the global complex 
log- weight zq, the gauged stochastic equations ()4.25|1 can be written in vector form 
as 



dz = Adt+ BdW - BQdt 



dzt) 



1 



-i-Q'^Qdt + G'^dW, 



(4.36a) 
(4.36b) 



where all bold quantities denote column vectors of A^^ elements labeled by j 
1, . . . , A'x etc. The probability distribution of a noise realization is 

dW^dW\ 



Pr((iVF) = exp 



(4.37) 



Idt ) 

since all noise elements dW^ are independent. With no gauge, dW = {B_')~^ [dz' — A'dt], 
leading to the probability distribution 

1 



P{z{t + dt)) = exp 



2dt 



{dz' - A'dty [{B'yYiB')-^ {dz' - A'dt) 



(4.38) 



while with a gauge the noise can be written 



dW = [B')-^ [dz' - A'dt + B'gdt] 



(4.39) 



This leads to a new probability distribution 

{dz' - A'dt + B'gdtf [{B')-Y{B')-^ {dz' - A'dt + B'gdt) 



Pg{z{t + dt)) = exp 



2dt 



Pg{z{t + dt)) 

g^{B')-\dz' - A'dt) + {dz' - A!dt)^[{B:)-^Yg + g'^g^ 



X exp 



Pg{z{t + dt)) exp 



-g^dw + ]^g^gdt 



(4.40) 



^Not 



since that applied only for Q — Q. 
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Where the last hne was arrived at using ()4.39|1 . This requires a compensating weight 



which is given by the gauged equations if = e °. 
As corollaries of this: 

1. If there was no noise in the original equations {B_ = 0), or the noise matrix 
was singular, then there would be no way a global weight could compensate for 
an arbitrary change in the drift equations — the new drift would just be plain 
wrong. For the case of very small noise in the z equations, even small changes 
with respect to the original drift A require very large weight compensation. 
These are the reasons why the gauges Q are multiplied by the coefficients 5 
in the dz equations, but not in the weight equation for dzQ. Thus 

If there is no random component to the evolution of a variable z, its 
drift cannot be modified using a drift gauge. If the noise matrix is 
singular, no variable drift can be modified using drift gauges. 

2. While the weight equation ()4.26j) appears disordered and random due to the 
presence of noises dWk, the evolution of Q is actually strictly tied to the random 
walk of the other variables zj, and acts to exactly compensate for any gauge 
modifications of the original drift. 

3. If we consider the (2A^2-dimensional) phase space of all base variables in C_ = 
{zj}, then a single real drift gauge Qk changes the drift only in the phase-space 
direction specified by the kth column B^k of the noise matrix^, associated with 
the kth noise dWk- 

4.3.3 Complex gauges 

The change in the drift due to complex gauges G = Q' + iQ" is always 




(4.41) 



A 



Adt - BQ'dt - iBg"dt. 



(4.42) 
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Table 4.1: Tally of drift gauge freedoms and comparison to phase space size. 





Number 


Kind 


Size of base phase-space C_ = {zj } (degrees of freedom) 




real 


Size of full phase-space C = {zq, C_} 


2N, + 2 


real 


Number of standard drift gauges Qk 




complex 


Number of noises dW^ in standard drift gauge scheme 




real 


Number of broadening drift gauges Qj 




complex 


Number of noises in broadening drift gauge scheme ()4.84j) 


3iV, 


real 



Imaginary gauges Q'^, in contrast to real ones, lead to changes in the drift only in a 
particular direction iB_k orthogonal to that affected by a real gauge. Note however, 
that this is a particular direction, and not any of the infinitely many orthogonal to 
B k- This kind of drift modification orthogonal to noise direction is not compensated 
for by changing the weight of the trajectory but by modification of its phase 
e*^o', leading to an interference effect between trajectories. It appears harder to grasp 
intuitively that this fully compensates, but in simulations such gauges are also seen 
to preserve observable averages (see Part B). 

Similarly to purely real gauges, no drift modification can be made if the noise 
matrix B_ is singular, as seen from ()4.42|) . Conversely, with non-singular 5 one 
has, in theory, full freedom to modify the deterministic evolution of the complex 
Zj to any arbitrary functional form. The weight Q will keep exactly track of these 
modifications by virtue of using the same noises dW. In fact, the weight evolution 
can be written as a deterministic function of the evolution of the remaining variables: 

dzo = g^ !^B-^{dz- Adt) + ^gdtY (4.43) 
A summary of drift gauge freedoms can be found in Table 14.11 

4.3.4 Weight spread 

While the global weight e^" is completely determined by the evolution of the base 
variables Zj, in observable calculations ()3.14|) it appears (particularly in the denom- 
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inator) as effectively a random variable. Other things being equal, it is certainly 
desirable to make the spread of the weights small during a simulation. 



Variances to consider 

In general there are several properties of the weights that may be of relevance. 
Re {Q} appears directly in the denominator average of observable estimates ()3.14|) 



via Tr 



A 



oc Q. On the other hand, for general observables, both the phase and 
magnitude of the weight may be important to calculate the numerator average in 
an obsrvable estimate ()3.14|1 — the phase may be correlated in important ways with 
the other variables in the average. 

For some observables and kernels, the numerator expression of ()3.14j) depends 
mostly only on Rejfi}. This is particularly so at short times when starting from 
a state well described as a classical (and separable) mixture of the kernels. In this 
case, both the numerator and denominator of the observable estimator ()3.14j) depend 
mostly on Re {fi}, and the variance of Re {Q} is the most relevant to consider. In 
more general cases, the variance of will be more relevant. 



Log-weight spread estimate 

A common hindrance when choosing the gauge is the difficulty of accurately as- 
sessing the size of the weight spread that will be produced by a given gauge. Here 
approximate expressions for the variance of Zq at small times ()4.47|) will be derived. 

Assuming there is no non-gauge weight drift {Aq = 0), the evolution in the 
variance of z'q = Re {^o} is given by 

dvar [4] = rf((4)'>,toch-2(4)stoch^(4)stoch- (4.44) 
Using properties of the Ito calculus, one can evaluate the time increments of these 
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quantities from the stochastic equation ()4.36bjl as 

k 

^((4)'>stoch = (24Re{Ao}),„,,rft + 5^((i?;,)^diy,>^^^^^ (4.45b) 

k 

k 

and so in terms of covariances 

c^var [4] = J2 {(Q'kf + covar [{Gl)\ z',] - covar [{G',)\ z',] >^^^^, . (4.46a) 

k 

Similarly one finds 

dvar [<] = ^ ((^D' - 2 covar z'^])^^^^^ . (4.46b) 

fc 

A practical approximation to the variance at short times can be gained by assum- 
ing lack of correlations between zo and Q (i.e. that the covariances are negligible). 
These approximations then would be 

var [4(t)] ^ V^{t) = var [^(0)] + f (^U^')'>3toch dt' (4.47a) 

-^0 k 

varK(t)]^y;(t) = vaT[z'^{0)]+ fj2{Skitr),,^^^dt'. (4.47b) 

•^0 k 

Of course this is never exact because Zq does depend on the gauges due to its evo- 
lution, however for small times, equal starting weights (var [z'q] = var [zq] = 0), and 
autonomous gauges (i.e. Q does not depend explicitly on zq) the expressions ()4.47j] 
are good approximations. They can be used to assess the amount of statistical noise 
that will be introduced by a particular gauge. 

Small time regime for log-weights 

How small is "small time" in this context? Clearly times when the covariances are 
much less than the Ylf^iGkY terms. From ()4.46|) . this will occur for autonomous 
gauges and equal starting weights if 

4it)J2\^k\' (4-48a) 

k k 
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and 



(4.48b) 



To obtain limits on these, one needs an approximation to zq. Since dWk ~ O dt 
then dW is the dominant term in the evolution of zo at short enough times. This 
implies 



(4.49) 
(4.50) 



The root of sum of squares is due to independence of noises dWk- For the approxi- 
mations to z'q to apply one needs 



dt 



ldtJ2m' » |Re{Ao}rft| = ^ 



k { k ) { k ) 



dt < 



^Eki& 
{Ek\Sk\'} 



2 • 



(4.51) 



Given this is the case, z'^ is given by ()4.49|) . and so the condition ()4.48ap for ()4.47ap 
to hold becomes 



dt < 



{EklOkl'} 



2 • 



(4.52) 



Since this agrees with the initial assumption of noise term dominance ()4.5ip . it is 
the short time condition under which ()4.47ap is accurate. 
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For dzQ to be given by ()4.49|1 . one needs 



IdtJ^i^k? > \lra{Ao} dt\=dt 



j:^kQ. 



k K k ) j k 



dt < 



EkiGkY 



The condition for ()4.47bjl is then 



k \^ k k 

Y.^Qif » ^dtlY.^',g'A <AdtY,{<3-fY.^'3i) 

k \ k ) j k 



dt < 



^Ekis'.r 



(4.53) 



(4.54) 



Again, this agrees with the initial assumption of noise term dominance ()4.53|) . and 
is the short time condition under which ()4.47b|) is accurate. 



Log-weight variance limits 

As will be discussed in Section 17.41 and Appendix the variance of Zq should be 
< O (10) for the simulation to give results with any useful precision. Basically when 
z'q of a variance O (10) or more, the high positive z'q tail of the distribution contains 
few samples, but much weight, leading to possible bias. Also if z^ has a standard 
deviation of O (tt) or more, mutual cancelling of trajectories with opposite phases 
will dominate the average, concealing any average in noise for reasonable sample 
sizes 0{< 10^). 

This imposes a limit on how long a gauged simulation can last, and so it is 
extremely desirable to keep the magnitude of the gauges as small as possible. 

As a corollary to this point, it is desirable to ensure that all drift gauges are zero 
at any attractors in phase space, to avoid accumulating unnecessary randomness in 
the weights when no significant evolution is occuring. 
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Direct weight spread estimate 



One can also investigate the evoultion of the variance of the weight Q directly, and 
analytic estimates can be obtained in some situations. To better understand the 
effect of real or imaginary gauges on the weight, let us consider the case where 
gauges Qk and weight fl are decorrelated, there is no base weight drift {Aq = 0), 
and n = 1 initially for all trajectories. Let i7 = + iQ", then from ()4.26|) 



(4.55a) 
(4.55b) 



If we consider the evolution of the second order moments of the weights, then under 
the uncorrelated Q and Q assumption, it can be written as a closed system of 
equations 



d 



where 







[n'f 


> = 








stoch 




Ci(t) 




C2{t) 







Cl 


C2 


-2C3 






C2 


Cl 


2C3 


( 




C3 


-C3 


Cl - C2 







dt. 



stoch 



(4.56) 



' stoch 



' stoch 



stoch 



(4.57a) 
(4.57b) 
(4.57c) 



This system can be solved, and remembering that (f^(t))gtoch ~ ^ (from ()4.55j) ). and 
Q{0) = 1, one obtains the solutions 



var [fl'] 
var [Q"] 

(^'^")stoch 



ie/oCi(t')rfi' 



2 
1 

2' 



Jlc.{t')dt' ^ ^-ilc,it')dt' COS ( 2 / C3(t') dt 



e/o* dt' _ /* c,(i') di' COS ( 2 / C3(t') dt 



ieAn^i(*')-2(t')W<'sin(2 / c^(i!)dt 



- 1 

- 1 
(4.58) 



If the gauge averages Cj(t) are approximately constant with time, then at short 
times, when the condition that fl and are uncorrelated holds, the variance of the 
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weight is 



var m = t E ((^^)'>stoch + I E <(^^ - ^"^)'>stoch + ^ it') ■ (4-59) 



2 

k k 



In similar manner, one can also obtain 



var[|f2|] = exp 



1. (4.60) 



Some more conslusions about drift gauge forms 



For most observables it is the variance of Q' that is most relevant for uncertainties 
in the finite sample estimates, because it appears both in the numerator and de- 
nominator of ()H.14j) . However, for some like the local quadratures {a\ — ak)/2i the 
modulus of the weight is more relevant in the numerator of the observable expres- 
sion ()3.14|) . One can see from ()4.59p that at short times imaginary gauges lead to 
smaller real weight spreads because the variance grows only quadratically, most of 
the gauge noise going into Q". This makes imaginary drift gauges more convenient 
for short time estimates of most observables 

4.4 Diffusion stochastic gauges 

The Fokker-Planck equation specifies directly only the diffusion matrix D{C), which 
is then decomposed via 

D = BB^ (4.61) 

into noise matrices B, however these are not specified completely. This freedom 
in the choice of B, leads to a different kind of stochastic gauge than considered in 
the previous sections, which will be termed "diffusion gauges" here. No weights are 
required, and only the noise terms are modified. As with kernel stochastic gauges, 
the diffusion gauges may in some cases be used to choose a set of equations with 
the most convenient stochastic properties. The non-uniqueness of B has always 
been known, but has usually been considered to simply relabel the noises without 
any useful consequences. It has, however, been recently shown by Plimak et a/j2] 
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that using a different B than the obvious "square root" form B = Bq = D^^^ leads 
to impressive improvement in the efficiency of positive P simulations of the Kerr 
oscillator in quantum optics. (This has a similar form of the Hamiltonian to the 
nonlinear term in ()2.17p .) This somewhat surprising result leads us to try to quantify 
the amount of freedom of choice available in the noise matrices In the process, it 
will also become apparent why some extra properties of the kernel beyond the most 
general case are needed for diffusion gauges to be useful for simulations. 

4.4.1 Noise matrix freedoms with general and complex an- 
alytic kernels 

At first glance there appears to be a great deal of freedom in the choice of noise 
matrix B. Consider that the relationship D = BB^ for real variables can be 
satisfied by a iV„ x Nw noise matrix, subject to the Ny{Ny + l)/2 real constraints 
Djk = J2p^jp^kp- The number (Nw) of independent real Wiener increments is 
formally unconstrained. It turns out, however, that much (if not all) of this freedom 
is simply freedom of labeling and splitting up a single Wiener increment into formally 
separate parts having no new statistical properties. More on this in Section [4.4.31 

Recahng Section the Ito Langevin equations (gSSI), or (jOfill 

can also be written (Whether the Cj are real or complex) as 

dCj{t) = Aj{C, t)dt + dXj{C, t), (4.62) 

where the Bj^dW^ terms have been coalesced into a single stochastic increment 
dXj. Only the means and variances of the Wiener increments dWk are specified, 
and so by the properties of Ito stochastic calculus, the only binding relationships for 
the stochastic terms dXj are — mutual variances specified by the stochastic 

average of the diffusion matrix, and zero means. 

Let us ask the question "do the relationships completely specify the sta- 

tistical properties of the dXjT\ 

Consider firstly that because all the increments are taken to be "practically" 
infinitesimal in any simulation, then over any significant timescale the noise due 
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to the Wiener increments dWk will be effectively Gaussian, whatever the actual 
distribution of dWk used^. This is due to the central limit theorem. Also, ()3.5()aj) is 
guaranteed by the independence of the Bj^. and dWk in the Ito calculus, combined 
with ('^W^A:(i))stoch ~ 0- These constraints then imply that to check if there is any 
freedom in the statistical properties of the dXj it suffices to compare the number of 
covariance conditions (j3.50bj) with the number of covariance relations for the dXj. 

Seemingly this is trivial: for real variables Cj, there are Ny{Ny + l)/2 condi- 
tions ()3.50bj) . and the same number of possible covariance relationships {dXj dX^)^^^^^ 
This means that 

In the general case of a kernel with no additional symmetry properties, 
all the (formally different) possible choices of the noise matrix B have 
exactly the same effect on the statistical behavior of the Langevin equa- 
tions. 

This is despite the noise matrix having formally many free parameters. These then 
are largely freedoms to relabel and split up the Wiener increments without intro- 
ducing any new statistical behavior, as will be discussed in Section lOl 

This simple comparison of conditions and relationships also explains why trying 
various forms of the noise matrices B was for a long time thought not to have any 
useful effect — under general conditions it doesn't. 

It has been found, however, that a positive P simulation can be optimized by 
particular choices of i?j2]. Let us consider the case when the kernel A can be 
written as an analytic function of A^j, complex variables C = {zj}. Now, there 
are Nz{Nz + l)/2 complex conditions ()3.50bj) . As for statistical properties of the 
dXj, we have to consider independently both the real and imaginary parts, dXj and 
dXj respectively. Their covariance relations are (rfXj (iX^)^^^^^^, (^dX" dXj^'^^^^^^, 




real conditions per one complex, this means that there are left over degrees of 
freedom in the covariance relations between elements of the complex quantities dXj. 
These might be used to tailor these X| free variances or covariances to our needs. 
^Provided that the variance of dWk is finite. 




and {dXj dXj!'^ numbering Nz{2Nz + 1) real relations in all. Since there are two 
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This begs the obvious question of "why do the analytic complex variable kernels 
seem allow more freedom than a general real variable kernel", since they can also 
be written in terms of real variables - we just split each complex variable into real 
and imaginary parts. 

The key lies in the fact that kernels analytic in complex variables have special 
symmetry properties that give us freedom to choose the complex derivatives of the 
kernel as ()3.51|) . or indeed as 

dA{C) ^ dAjC) dAjC) 

using an analytic kernel stochastic gauge J-". This "analytic" symmetry then enters 
into the Fokker-Planck equation, and finds its way into the Langevin equations as 
freedom of choice of noise increments dXj. As with all stochastic gauges, another 
way of looking at this is that because the kernel has certain symmetry properties, 
there is a whole family of distributions P{C) of such kernels corresponding to the 
same quantum density matrix p. Choosing gauges chooses between these different 
distributions. 

In the following subsections, several specific types of freedoms (or "stochastic 
gauges" ) available in the noise matrix B will be considered, and a tally will be made 
at the end in Table 

4.4.2 Standard form of diffusion gauges for analytic kernels 

Let us consider the analytic kernel case, where it was seen above that there may 
be useful noise matrix freedoms. Such kernels tend also to be the most convenient 
generally because a positive propagator, and so a stochastic interpretation, is always 
guaranteed by the method in Section 13.4.31 

Since D = is square and can be made symmetric^, its matrix square root is 
also symmetric = , and can be used as a noise matrix Bq 



Bo = VD (4.64) 
D = BoBl (4.65) 



^Since d^/dvidv2 = d^/dv2dvi for any variables vi and V2 can be used in the FPE to achieve 
Djk ~ Dkj- 
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This square root form can be considered as the "obvious" choice of decomposition, 
unique apart from the Nz signs of the diagonal terms^. However, for any complex 
orthogonal O such that 00^ — I , if Bq is a valid decomposition of D, then so is the 
more general matrix B — BqO. So, any matrix in the whole orthogonal family 



B = BoO 



(4.66) 



is a valid decomposition. A general complex orthogonal matrix can be written 
explicitly using an antisymmetric matrix basis a^^''\ (j ^ k = l,...,Nz) having 
matrix elements 



^i^''^ = ^ji^kp - SjpSki- (4.67) 
With these {N^ — l)Nz/2 independent x matrices cr^^*^^ the general form is 



\j<k J 



(4.68) 



The Qjk are the Nz{Nz — 1)/2 complex diffusion gauge functions, which can in princi- 
ple be completely arbitrary, including dependence on all variables in C (not necessar- 
ily analytic), and the time variable without affecting the correspondence between 
the Langevin stochastic equations, and the FPE. 

As an example, in the case of two complex variables, there is one complex gauge 
function gii2, and the resulting transformation is 

O = exp(^i2a(^'0 

= cos(^i2) + (7(^')sin(^i2), (4.69) 

where the anti-symmetric matrix cr'-^^'' is proportional to a Pauli matrix: 



a 



(12) 



1 
-1 



(4.70) 



Hence, e.g. if the diffusion matrix is diagonal, Bq is also, and the transformed (but 
equivalent) noise matrix becomes: 



B 



12 



(4.71) 



'^One for each complex variable. 



Section 4.4 Diffusion stochastic gauges 



85 



The square form of B ()4.66|1 with O given by ()4.68|1 will be termed here the "stan- 
dard" diffusion gauge, to distinguish it from some other forms that will be discussed 
in later subsections. 

4.4.3 Real standard diffusion gauges and noise mixing 

Not all the canonical diffusion gauges gjk are useful. One class of useless gauges 
to avoid are those that seemingly change B, but effectively only swap around the 
linear combinations of noises dWk, without affecting dXj or hence any statistical 
properties of the equations. Their main effect is usually to complicate the math. 

Consider, for example, the two-complex variable case given by ()4.(j9j) . For a 
purely real diffusion gauge gi2 = the effect of the noise terms in the Langevin 
equations becomes 

= BodW. (4.72) 

This is just a rotation of the noises dW, leading to dW being a linear combination 
of the noises dWi and dW2- The new noises dWj have the same statistical properties 
as the old dWj. Thus dX{g[2) = B{g[2)dW has exactly the same effect in the 
equations as the un-gauged dX{g[2 = 0), regardless of any complicated form of g'12 
we choose to try. 

More generally, with Nz complex variables, there are at least Nz{Nz — l)/2 such 
useless noise rotations available, one for each pair of variables. Looking at the 
standard orthogonal matrix form ()4.68p . one sees that the terms in the exponential 
proportional to real parts of gauges gjk represent simply all these useless rotations. 

In a formalism with only real variables Cj, and no extra symmetries, diffusion 
gauges arise formally in the same manner as in Section I4.4.2| although all the ma- 
trices D, B, Bq, and O must now be real, making gjj^ also only real. This means 
that in such a case all the resulting real gauge functions in 0{gjk) are useless noise 
mixers, and no useful modifications of the stochastic equations can be achieved by 
choosing them. This explains why the potential of noise matrix choice to improve 
the stochastic equations went unnoticed for many years. 



dX = B{g[2) dW = Bo 



cos^Js sin 
-smg[2 cos^ia 



dWi 
dW2 
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This subsection can be summarized as 

Only the imaginary parts of the standard diffusion gauges ()4.68j) can 
affect the statistical properties of the stochastic equations. 

Lastly, it may be worth pointing out that real gauges can achieve the same useless 
noise rotations in non-square noise matrices as well. For example, for any arbitrary 
real function / the noise matrix 



B cos / B sin / 



(4.73) 



satisfies B^^^B^^^Y = ^ just as well as the original -B, whatever the form of the 
arbitrary complex function /. The stochastic increments are (for A^^ complex vari- 
ables) 



2N^ 



k=l 

= Bjk{cos fdWk + sin fdW^^+k) 

k=l 

= ^S.-.c^TV^, (4.74) 



k=l 



with the combined noises dW^. having the same statistical properties irrespective of 
the choice of /. 



4.4.4 Imaginary standard diffusion gauges and statistics 

The imaginary part of standard diffusion gauges g'^j^ = Im {gjk} do affect the statis- 
tics. Again consider the two- variable case of diagonal D of expression ()4.69jl . this 
time with imaginary gauge gu = igi2- 



dX = Bn 



cosh^f^'a isinhg'l^ 
-i sinh g'l2 cosh g'l^ 



dWi 
dW2 



(4.75) 
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For example, in the simplest case of diagonal positive real D, one has (using the 
notation dXj = dX', + idX'') 



dX[ = y/Du cosh g'l^dWi (4.76a) 
dX!^ = ^/D^ cosh g'l^dW2 (4.76b) 



dX'; = y/Diismhg'l^dW2 (4.76c) 
dX!^ = -^/Di2smhg';^dWi. (4.76d) 

While the covariance requirements (j3.50bjl are satisfied, the variances of the dXj 
components may vary depending on (7^2 • The nonzero correlations are 

((^^0'>,toch = ^osh' g';,{Du)^,,,^dt (4.77a) 

<(^^2)'>3toch = COsh2^?12(^22),tochrf^ (4.77b) 

<(^^")'>,toch = sinh^^?i2(^ii)stoch^^ (4.77c) 
((dX^')'>stoch = sinh^^7i2(^22)stoch^^ (4.77d) 
{dX[dX^)^^^^^ = -lsmh2g'lJy/DuD22) dt (4.77e) 

A \ I stoch 

(dX^dX('),toch = ^sinh2^i'2(\/A^) dt. (4.77f) 

Z \ / stoch 

Such imaginary gauges only apply when the model is expressed in complex vari- 
ables Zj with an analytic kernel. A glance at the canonical gauge form ()4.68|1 . indi- 
cates that there is then one such imaginary gauge g"^ = Iml^fj^} for each possible 
pair of variables. 

4.4.5 Non-standard diffusion gauges and further freedoms 

In Section 14.4.11 it was seen that there are degrees of freedom^ for the statistics 
of the dXj, but later, in Sections 14.4. 31 and I4.4.4l it was shown that there are at most 
Nz{Nz — l)/2 useful (imaginary) standard gauges gjk = ig'-^. Conclusion: there are 
some more freedoms not included in the standard square noise matrix expression 
()4.66p and ()4.68|) . The simplest example occurs when there is only one complex 
kernel parameter Zi. there is one degree of freedom in dXi, but zero standard 
gauges. 

*The case of a kernel analytic in complex variables is being considered all the time. 
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To see how the extra non-square gauges can enter the picture, let us consider 
the simplest A^^ = 1 case. Here there will be three covariance relations for dX[ 
and dX'^, but two conditions ()3.50|) on these. Firstly, note that while the standard 
gauged noise matrix B = BqO is square in the complex variables zj, in the real 
variables Zj and z" it is a 2Nz x Nz — in this case a 2 x 1. This leaves no room for a 
third degree of freedom in dXi = BdW. There is, however, no particular limit on 
the number of columns of B or the number of noises in dW, provided BB^ = D, 
so let us instead consider a 2 x 2 noise matrix. 




(4.78) 



where subscripts r and i denote elements relating to the real and imaginary parts of 
the complex variable Zi. Since there are only three degrees of freedom , we can set 
one of the elements to zero with no restrictions on the statistical properties of dXi 
(say b^i = 0). The remaining elements must satisfy the conditions (from 

(Re dt = {{dX[f - (rfX(')^).,„,, = dt {hi - 6?, - 6f,>^^^^^ (4.79a) 

(Ini {I^}),toch dt = 2 = dt , (4.79b) 

leaving room for one arbitrary function (i.e. gauge) out of the three nonzero elements 
6rr, b^i, or bii. In the standard noise matrix form from ()3.54j) one had ha = 0, so all 
the noise matrix elements were then completely specified by ()4.79|) . with no room 
for a gauge. 



4.4.6 Distribution broadening gauges 

Consider the case of no diffusion D = 0. When the kernel is analytic in complex 
variables, there can actually be nonzero noise matrices, leading to nonzero noise 
despite no diffusion in the FPE. All that is required is that the complex B obeys 
BB^ = 0, or in terms of stochastic increments dX = BdW that 

(^^ ■ rf^^>stoch = D = (4.80a) 
(rf^)stoch = 0. (4.80b) 
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A viable noise matrix is, for example, 



B 



9 ^9 



(4.81) 



where ^ is a A^^ x complex matrix, all of whose elements can be any arbitrary 
functions (gauges) we like without affecting BB^ = D = 0. There is no limit on 
the width Nw- For nonzero diffusion, the same kind of gauges can be attached to 
an existing noise matrix {BB^ = D) at one's leisure via 



B 



B B 



(4.82) 



The new noise matrix also obeys BB^ = D. Nonzero noises corresponding to zero 
diffusion in the FPE are possible because the correspondence between stochastic 
equations and the FPE is exact only in the limit of infinite samples. Thus the 
conditions ()4.8()j) apply only in the limit of infinitely many trajectories, and B = 
is just one special case in which these conditions are satisfied for every trajectory 
on its own. 

The diffusion gauges g will be termed "distribution broadening" here because 
their effect is to make the distribution of the complex variables zj broader, while 
preserving their complex means and mutual correlations. Generally such broaden- 
ing gauges are not of much use, simply making everything more noisy and reducing 
precision of observable averages, but there are some situations when this is advan- 
tageous. 

One example occurs when the noise matrix is singular, or there is no native 
diffusion in the FPE (i.e. dXj = 0) for a variable Zj. In this situation drift gauges 
are unable to make modifications to the deterministic evolution. If one, however, 
uses a broadening gauge to force a nonzero noise matrix, then the usual drift gauge 
formalism can be used and arbitrary modifications to the deterministic evolution 
made. The idea of creating additional drift gauges in this manner was first proposed 
(in a Nz = 2 case) by DowlingjOH]- Let us see how this proceeds in a general case: 

For clarity in the resultant equations it is best to define a diagonal broadening 
gauge matrix g with elements 

9jk = gjSjk- (4.83) 
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Adding the broadening gauge to a pre-existing noise matrix B as in ()4.81|) and ()4.82|1 . 
one obtains a A^^ x 3Nz noise matrix and SN^ independent real noises. Each of these 
noises can now have a drift kernel gauge attached to it in the manner described in 
Section For the purpose of arbitrary drift manipulation it suffices to introduce 
Nz complex drift gauges Qj on just Nz of these noises — say on the noise matrix 
elements gj. This leads then to the stochastic equations 

dzj = Aj dt + Y^ BjkdWk + gj (dWj + idWj - Qj dt) (4.84a) 

k 

dzQ = Aodt-^Y,^l^^ + Y.^^^^r (4- 84b) 

j j 

The "broadening noises" dWj and dWj are independent real Wiener increments 
just like the dWk- 

The arbitrary gauge modifications to the drift of a variable 

- gjQj dt. (4.85) 



A related situation occurs when the native stochastic increment dXj for a variable 
Zj is small. In such a situation, making a significant change in the drift of Zj 
would require large compensation in the log- weight zq, leading to rapidly increasing 
statistical uncertainty or even bias from widely varying trajectory weights. However, 
if one introduces a broadening gauge such that 

var Re{gj}dWj > vai [Re {dXj}] 

(Rea}'>stoch » E(l^^{^^-^}'>stoch' (4-86) 

k 

then the drift of Re{2;j} can be modified at smaller weighting cost then if one had 
used standard drift gauges. Analogously for lm{zj}. In fact, there appears to 
be a tradeoff here between the amount of noise introduced into the log-weight zq 
(proportional to Qj), and the amount of noise introduced directly into the variable 
whose drift is being modified (proportional to gj). 
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4.4.7 Diffusion from different physical processes 

A commonly occurring situation is that several physically distinct processes give 
separate contributions to the diffusion matrix, e.g. 

D = ^Z}(0. (4.87) 

I 

Calculating the square-root noise matrix Bq may, in some cases, give a very compli- 
cated expression in this situation. If this is a hindrance in calculations, a much more 
transparent noise matrix decomposition is possible. One decomposes each diffusion 
contribution Z^^'^ separately into its own noise matrix as 

= 5«(E«)^. (4.88) 

and then combines them as 



B 



(4.89) 



This results in separate noise processes for each diffusion contribution D^''\ and 
formally separate drift gauges (one complex drift gauge per real noise). The benefit 
of doing this is that the resulting stochastic equations have stochastic terms of a 
relatively simple form. On the other hand, a possible benefit of doing things the 
hard way with Bq a direct square root of the full diffusion matrix D is that diffusion 
contributions from different processes / may partly cancel, leading to a less noisy 
simulation. 



4.5 Summary of standard gauges 

The standard gauge choices that will be used in subsequent chapters can be sum- 
marized in vector form by the equations 

dz = Adt+ BoO{{g}){dW -gdt), (4.90a) 
dzo = Aodt + (^dW -^gdt^ . (4.90b) 

This uses a kernel of the form ()4.9|) proportional to a global weight Q = e^°, and 
analytic in A^^, + 1 complex variables Zj, zq. A weighting kernel gauge, standard drift 
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kernel gauges, and standard imaginary diffusion gauges have been used {gjk = ig'-^ 
forming the set {g} of arbitrary real- valued gauge functions). Underlined quantities 
are those obtained from the FPE before introduction of drift gauges, while the bold 
quantities denote column vectors with an element per base configuration variable 
Zj^Q. And so, A and Aq are drift coefficients from the un-gauged FPE, dW are 
real Wiener increments (with each element dWj statistically independent for each j 
and each timestep, of zero mean and variance dt), and Q are arbitrary complex drift 
gauge functions. B_q = is the square-root noise matrix form, O is the A^^ x 
orthogonal matrix given by ()4.(i8j) . but in this case dependent only on imaginary 
gauges ig'ji^. The gauge freedoms in this standard formulation are summarized in 
Table Ol 

Table 4.2: Tally of diffusion and noise matrix freedoms in the standard formulation 
14.90|l : Kernel analytic in complex phase-space variables fSection I3.4.3|l . standard imaginary dif- 
fusion gauges (Section I4.4.2|l . standard drift gauges (Section |4.3() . 





Number 


Kind 


Base phase-space variables C_ = {zj} 


A^. 


complex 


All variables C = {zq, C_} 


N, + l 


complex 


Noises (Wiener increments dWk) 




real 


Drift gauges Qk 

Imaginary diffusion gauges gjk = ig'-^ 


N,{N, - l)/2 


complex 
imaginary 
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Table 4.3: Tally of diffusion gauge and noise matrix freedoms. The degrees of freedom 
counted are always real, not complex. 





ivei iiei aiiaiyLic iii 


^oreiieiai Keiiiei 


IN UlliUt;! Ui. 


l\i / \ "V y 1 T-v 1 cy\r \ rQ T"! Q r~t 1 £1Q 
i V 2 L^UllipicA. VcLi IcLUlco 


111 iV^ itjcLl VcLilcLUlcb 


Real variables 




AT" 


bovariance relations between aXj 


AT / C\ AT 1 -l\ 

N^{2N^ + 1) 


A 7 / A 7 1 -1 \ /O 

A^^(A^^ + l)/2 


Covariance constraints on dXj 


A^^A^^ + 1) 


A 7 / A 7 1 -1 \ /O 

A^^(A^^ + l)/2 


T -\ I J * 1 1 r 1 

rotentially useml 






statistical freedoms between dXj 


iV 

2 


U 


maximum Wiener increments dWk 


oo 


OO 


J I 1TTT* * i IT T T 

standard Wiener increments aWk 






Elements in standard square root 






noise matrix Bq = \/~D 


O A 79 


TV 

V 


blements m extended noise matrix 






with real columns 




iV 


uanonical eauees Qji. m (|4.d8P 


A 7 / A 7 1 \ 


AV,(A/^ - l)/2 


— useless real gauges g'^^. 


iV,(iV, -l)/2 


iV,(iV, - l)/2 


— useful imaginary gauges g"^^ 


iV,(iV, -l)/2 





Potentially useful non-standard gauges 


N,{N, + l)/2 





Useless potential noise spawning gauges 


oo 


oo 


Potential broadening gauges in g 


oo 


not applicable 


Broadening gauges g^ in 






drift gauge scheme ()4.84|). 




not applicable 



94 



Chapter 5 The gauge P representation 



Chapter 5 

The gauge P representation 

The gauge P representation, which will be used in subsequent chapters, is explained 
and its properties investigated here. It is based on the positive P representation of 
quantum optics, which uses an off-diagonal coherent state kernel, but due to the 
inclusion of a global phase can be used to introduce weighting and drift gauges. The 
original distribution concept is due to P. D. Drummond, and its basic derivation is 
given in published work by Drummond and Deuar [U^l 13^ IHT] . 

The positive P distribution is a promising starting point because it has al- 
ready been successfully used in some many-body problems in quantum optics (e.g. 
squeezing in optical solitons [121 HI]) and with Bose atoms as well (evaporative 
cooling ^5 ^1 Uni)- The coherent-state mode-based approach gives simple equa- 
tions similar to the mean-field Gross-Pitaevskii (GP) equations, and automatically 
applies to open systems. However, stability and systematic error problems can 
occur [701 EH EH inni with these simulations, which hinders effective use of the 
method. The gauge P representation inherits all the useful features of the posi- 
tive P but allows drift gauges, which can be used to tailor the equations so that 
stability problems and systematic (boundary term) errors are removed. Weighting 
gauges allow one to also perform thermodynamic simulations of grand canonical 
ensembles by including an evolving trajectory weight. 

Stochastic equations for the interacting Bose gas model are found in Sections 15.31 
and 15.61 Extension to non-local interparticle interactions is derived in Section 1^31 
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5.1 Properties of the representation 

The representation uses an un-normalized (Bargmann) coherent state basis. On a 
subsystem j this basis is 



)j = exp (a^ajj |0) = exp (^-^^^ (5.1' 



with the complex amphtude aj, and mean particle occupation (rij) = {ajl"^. As usual, 
the boson annihilation operators at subsystem j are aj, and obey the commutation 
relations 



aj , al 



: 6,k. (5.2) 
The basis states are mutually non-orthogonal 

{f3*\\,\\a,)k = 5,ke"^^^, (5.3) 

(from ()3.30p ) and overcomplete (from ()3.3H) ). 

The kernel (on separable subsystems) is chosen to be 

A = e-<.,A,, ^^^^^ 

Aj = \\aj)j{(3j\\j exp{—ajf3j) . 
If one defines coherent amplitude vectors ex. and /3 to contain all A^ elements aj 
and Pj, respectively, then the full variable set is C = {^o, ck, containing 2N + 1 
complex variables. The non-orthogonality of the basis coherent states allows nor- 



malization Tr 



A 



apart from the complex global weight = e^°. The kernel 
is also seen to be analytic in all complex variables aj, Pj, and zq, so the procedure 
of Section 13.4.31 can be used to ensure a stochastic realization for any FPE. 

As per expression ()3.ip . the density matrix (possibly un-normalized) is expanded 

as 

p^ = J Pa{C)A{C) d^^'ot rf2^/3 d^zo (5.5) 

with the gauge P distribution Pg{C). 

Now, it can be verified by expansion that 



= ^jk(^j\Wj)j (5.6a) 
dai 



d 

= (5.6b) 
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This then leads to the basic kernel operator correspondences 





= ajA, 


a]A 


= [f^^ + 




= (^aj + 


Aa] 


= f3,A 



d 
d 



(5.7a) 

A, (5.7b) 

A, (5.7c) 

(5.7d) 



in the form ()3.39p Together with 

^A = A. (5.8) 

which is seen to apply by inspection of ()5.4|1 . these operator equations can be used to 
obtain observable moment estimates and stochastic equations using the procedures 
of Sections EM EM EM 

The similarity of the gauge P to the positive P representation allows one to 
adopt some exact results obtained for the latter. Any state that has the positive P 
representation P+(q;,/3) can be represented by the gauge P representation 

PG{zo,cx,(3) = 6\zo)P4(x,f3), (5.9) 

by inspection of the kernels ()5.4p and ()3.6|) . (This is not a unique correspondence, 
but the simplest of many.) 

Since it has been shown that any quantum state p has a positive P representation fH] . 
it follows from ()5.9p that all quantum states must also have gauge P representations. 
The constructive expression ()3.10|1 for a positive P representation of arbitrary p can 
be substituted into ()5.9j) to obtain a similar expression for the gauge P. 



5.2 Observables 

Calculation of observables proceeds in a very similar manner to the positive P repre- 
sentation in Section 1!^. 3. 5[ All operators on supported states can be written as linear 
combinations of the moments of the local annihilation and creation operators cij, aj. 
Thus, to evaluate any observable it suffices to know how to evaluate an expectation 
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value of a Hermitian operator with two adjoint separable terms of the form 

Q{{Lj, L'^], ^) = Y ®^ "'Lj ®^ ^ife + ^ ®fc oL'^ ®j aiy (5.10) 

6^ is a phase, the Lj and L'^ are subsystem labels (not necessarily unique), while the 
j and k are "subsystem label counters" . For example the particle number operator 
for subsystem p has j = k = {1}, Li = L[ = p, 9 = 0; while a quadrature operator 
q{9) = |(a'l"e*^ + ae~*^) on subsystem p has j = {1}, k = {}, Li =p. 



Using ()5.7|) and Tr 



A 



e^°, one then obtains 



Tr 



QA 



T n n (^^, + T'-p] ? n n U ^^y^ m 

k j ^ ^ ' j k \ fc/ 

^ n n + n n • (^-n) 

j k k j 



Similarly, 

Tr 



j k k j 



These can be entered directly into the general observable estimate expression ()3.14|) 
giving 

oiiL L'\e) - (M^-"(-"n../3.,^..+e-^n../^,,^,j})_^ 

{Q{{L„L',},9)) = lims^^Q{{Lj,L',},9). 

Comparing and one sees that the same form appears in both, and 

so an algorithm to determine the finite-sample estimate of the expectation value of 
an arbitrary observable (O) can be given: 

1. Normally-order O, using ()5.2|) by placing all annihilation operators to the right. 

2. The numerator of the stochastic estimate O is formed by replacing a^, at, and 
® in the normally-ordered expression for O by a^, and Yl^ respectively, 
and finally multiplying by the global weight = Q. 

3. Take the average of the real part of the numerator terms, and divide by the 
average of the real part of the weight (Re {e^°})g^Q^j^. 
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In cases where the normahzation of p is known to be conserved (e.g. dynamical 
master equations ()2.2()|1 . but not thermodynamic ()2.29j) ). the average in the denom- 
inator can be dropped, because it is known to always be equal to one in the limit 
of many trajectories. Its variations about unity are an indication of the sampling 
error in the calculation, but are not necessary to obtain expectation values. 

The above algorithm applies also when O is an infinite sum of moments, such as 
for example the parity operator at subsystem k, which can be written as 



(5.14) 



n=0 



in terms of Fock number states |n)j^. with occupation n at subsystem k. Since 
\n)f^ = (l/-\A?T)(aJ.)" |0), the one obtains 

(Re{exp {zq- ^akPk)}) ^toch 



lim 

<S— »oo 



(5.15) 



With such operators of infinite order in annihilation and creation operators, one 
should, however, be wary of boundary term errors (of the second kind), as explained 
in Section IB. 2. 1[ 

Finally, for estimates of fidelity using expression ()3.22|) . one requires the trace 
of kernel products. Using ()5.4|) and the properties of Bargmann states ()5.3|) . one 
obtains 



Tr 
Tr 



A(ai,/3i,l]i)At(a;,/32*,fi;) 



^1^2 exp [-(cKi - as) ■ (/3i - /Ss)] (5.16a) 
fiifi; exp [-(cKi - (3;) ■ (/3i - «;)] (5.1j6b) 



5.3 Dynamics of locally- interacting Bose gas 

The master equation for the dynamics of a locally interacting Bose gas on a lattice 
is ()2.20j) with the Hamiltonian given by ()2.17|) . Using the operator correspondences 
()5.7|) . and the methods of Sections 13.4.11 13.4.31 one can obtain an FPE for this 
system. Subsequently using the methods of Section 13. 4. 2^ the square root form 
of the noise matrix B_ = B_q = and standard drift gauges, the stochastic 

equations given below are obtained from ()4.25|) and ()4.26|) (or more directly, from 
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()4.9n|l . setting gjk = 0). Diffusion gauges have been omitted at this stage (so, 
0{gjk) = I) for clarity. Note that the M spatial modes are now labeled by the 
lattice labels n, or m as defined in Section 12.31 Each mode is a "subsystem" . 

It was chosen to separate the noise contributions from each process, as discussed 
in Section [4.4.71 This is done so that the noise terms take on a simple form. 

With no environment interactions, the purely Hamiltonian evolution is found to 
lead to the Ito stochastic equations 



da^i = —i'^^ujnm'^mdt — 2ixa'^Pndt + ian^/2ix{dWn ~ Qn), (5.17a) 

m 

C?/5n = i5^^mn/5mC?t + 2ixan/5nC?t + /5nV^(rfW^n-^n), (5.17b) 
m 

dQ = [^"^^" + ^"^^"] ' (5-1'^^) 

n 

with 2M independent real Wiener increments dW^ and dWn- The corresponding 
complex drift gauges are and Q^- 

Addition of a heat bath at temperature T results in the following additions: 



dan = yanrft + \/7nnbath(T)((ir7n - G^^) (5.18a) 

dPn = y/^nf/t + VlnUbMTKdr]*^ - G^n^) (5.18b) 

d^l = ■■■ + ^Y^ [^^nS'^ + dr,*g^^^'\ , (5.18c) 



where ribath is given by the Bose-Einstein distribution ()2.23|1 . Here the complex 
noises drj-a are independent, and satisfy 

{dVndv*Jstoch = ^r^mdt, (5.19a) 

{dVndVm) ^toch = 0' (5.19b) 

(^^n)3toch = 0. (5.19c) 



The drift gauges ^n^^ and Qn^ are also complex. 

Two-particle losses to a zero temperature heat bath result in the following addi- 
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tions to the dynamical equations: 

da^ = ^^^^al(3ndt + ian\/^ {dW^^^ -g^^^), (5.20a) 

c^/3„ = ■■■-7^^^a^Pldt + iPn^^ [dWi^^ -g^^^y (5.20b) 

dn = ■■■ + nJ2[s^n^dwi'^ + g^^'>dwi'^]. (5.20c) 

n 

(2) ■ (2) 

Here, again, the noises dWr and dWr are independent Wiener increments, and 
the drift gauges gn^ and gn^ are complex. 

Lastly, A coherent driving field leads to the deterministic corrections 

dan — 1- sdt, (5.21a) 

d(3n = ■■■ + s*dt. (5.21b) 

The numerical simulation strategy is (briefly) 

1. Sample a trajectory according to the known initial condition Pg(0) = Pg{ p(0) ) 

2. Evolve according to the stochastic equations, calculating moments of interest, 
and accumulating appropriate sums of them. 

3. Repeat for <S ^ 1 independent trajectories. 

One could equally well remain with 2M x 2M complex noise matrix, which 
would use only 2M real noises and 2M drift gauges, instead of the 6M real noises 
and 6M complex drift gauges above. In general, however, the stochastic terms 
would then have a complicated dependence on the parameters X; ''^bath, 7n, and 7n 
— possibly undesirable. On the other hand, the amount of noise in the simulation 
might be reduced, because the effects of one process may partly cancel the effects 
of another. This may be particularly so if both scattering oc x two-boson heat 
bath interactions oc 7^^^ are present. These two processes lead to similar terms in 
the equations (apart from factors of i, etc.), and may be expected to cancel some 
noise without causing excessively complicated equations. 

Some brief comments about the behavior of the above equations: 

• The (un-gauged) equations for dp^ are simply the complex conjugates of da^, 
apart from possessing independent noises. 



Section 5.4 Comparison to Gross-Pitaevskii semiclassical equations 



101 



• The Hamiltonian evolution ()5.17|) leads to a nonlinear equation in the coherent 
amplitudes, with obligatory noise. 

• All interactions with a heat bath (irrespective of the temperature) cause a 
deterministic exponential loss of particles. 

• Finite temperature thermal interactions cause a directionless randomization 
of the coherent amplitudes ck and /3, leading also to a mean growth of boson 
numbers. 

• Two-particle losses also lead to nonlinear equations and noise. 

• The noise from scattering and two-particle losses is directional in phase space, 
as opposed to thermal noise. 

• Coherent gain causes no noise in the equations. 



5.4 Comparison to Gross-Pitaevskii semiclassical 
equations 

The lossless equations ()5.17|1 are similar in form to the Gross-Pitaevskii (GP) semi- 
classical equations ubiquitous in calculations on Bose-Einstein Condensates at tem- 
peratures well below condensation. (For details of these gases and results that are 
obtained with the GP equations see e.g. the review by Dalfovo et a/ jSI].) Derivation 
of the GP equations basically assumes the particles all coherently occupy a single 
orbital, and can be described by its wavefunction. 
In fact, if one 

1. Ignores stochastic and drift gauge terms in ()5.17|) . 

2. Makes the assumption that the field is coherent — i.e. the local kernels are 
coherent state projectors An — > |an)n ('^nln^ implying 



13^ a*. 



(5.22) 
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Note that this is consistent with the first assumption of no stochastic terms, 
since djSn differs from rfa* only by having independent noises. 

3. Makes a variable change 

such that ip is the order parameter (i.e. the wavefunction normalized to = 
J iplx.) (P^, where is the mean particle number): 

then precisely the (lattice) GP equations for ip{^ri) are obtained. From this, it 
can be surmised that in regimes where the GP equations are a reasonably good 
approximation, the noise will be relatively small and calculations precise. 

It is quite remarkable that just by the addition of simple stochastic terms, full 
first-principles quantum evolution is recovered from a mean field theory. This con- 
venient property suggests that full quantum simulations using this method have the 
potential to remain numerically tractable. 



5.5 Extended interparticle interactions 

Equations for the case of extended interparticle interactions as in the Hamiltonian 
()2.12|1 will be derived here. Proceeding as in Section 15. 3[ the complex diffusion 
matrix (before introducing any drift gauges) in the FPE can now be written 

i^H 

D(^) 

where D^") appears in the FPE as hD^^ld"^ / da^^dar^, and D^^) as W^nldydf3ndf3„ 
Their matrix elements are given by 



D 



(5.24) 



h 

■ ^nm 



h 



(5.25a) 
(5.25b) 



Since the interaction potential Unm is symmetric, then it could be orthogonally 
decomposed by its matrix square root v, which satisfies 



vv = u = vv 



(5.26) 
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The matrix v could be calculated at the beginning of a simulation and subsequently 
used in noise matrices satisfying B^^^B^^'^]'^ = D^^^ (with z taking on the labels a 
or where 

BlSl = A/^ani^nm (5.27a) 
V n 

Bi^l = y|/5nt^nm. (5.27b) 

Unfortunately finding the matrix square root would usually need to be done numer- 
ically, which would require storing O (M^) matrix elements and take a time 
O (M^) to compute — not efficient for large lattices. 

A much more efficient, though involved, approach is possible. One would like to 
obtain some orthogonal decomposition of D dependent directly on that depends 
only on the interparticle spacing. This potential has M elements, rather than the 
M X M of Unm- Writing the Langevin equations in terms of stochastic increments 
dXi'^ (See Section E33) as 

dzn = Al^Ut + dXi'\ (5.28) 
the stochastic increments must obey 

(dX^'Ux^^)^^^^^ = Di^J^dt (5.29b) 
{dX'n^dX(^Xo., = 0- (5-29C) 
Firstly, we can define new stochastic increments dY^"^ and dYn'^\ such that 

dX^""^ = an^^dV^"'^ (5.30a) 

dXi^^ = f3^J^dY(^\ (5.30b) 



and these must obey relationships (Remembering from ()2.15|) that = = 

t^|n-m|), 

(rfF^^)dll^)>^^^^^ = f/|n_^|rft, (5.31b) 
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which now depend only on the a-priori potential Un, not on the dynamically evolving 

(z) 

mode amplitudes. Note that while the variances of the dYA ' are the same for both 
choices of z, the actual realizations of the increments must be independent. 

It is useful to now consider the P-dimensional discrete Fourier transform of the 
interparticle potential: 

U~n = Uk + ^k = 77^ E f^ne-^''^-^", (5.32) 

n 

with the normalization constant Cnorm = (2vr)-^/^/ Ax^. All the lattice notation 
used from here on has been defined in Section 12.31 Note now that since [/„ is real, 
and k n = — kn, then ()5.32j) implies 

Uk = f/iji (5.33a) 

Uk = -U'L^. (5.33b) 

The inverse transform is 

^n = ^X^^He^^-^". (5.34) 

n 

Expanding out the elements of lattice coordinate vectors, one can write |xn — Xm| = 
{di/S.xiirii — nil), ... , 'dv^xvin-D — "^x))}, where the quantities 'dd can take on the 
values +1 or —1, depending on what is required to take the modulus. Recalling the 
symmetry property of the interparticle potential that was assumed in Section [2 
(^{ni,...,nd,-} = ^{ni,...,Md-nd,...} ^oi any dimension rf), one obtains 



L^{ni,...,nd,...} — "ii 

ii d 

a 



'norm 



U^^-~ikd{nd)Xd{nd) JJ' ^ik^, {n^,)xj^, (n^,) ^ (5.35) 



M 

since explikdMaAxd] = 1 for any k^. So, the phase for dimension d can have either 
sign. This result can then be applied to each dimension where "^d = —1 to obtain 
(using also Xn-m = Xn — Xm) the expression (subtly different from ()5.34|1 ) 

Uin-mi = %^ E ^ne^'^-(--''-). (5.36) 



Section 5.5 Extended interparticle interactions 



105 



If one introduces new stochastic increments dZn \ then using ()5.36|) . condition 
()5.31b|) is equivalent to 



a 



{dY^'^dY^X.^^ = ^E^^'"''"^"^'"'''"V^Ht/s.(rf4^<^^) (5.37) 



M ^ V " - \ n m /^^^^jV 

nm 

provided that 

'rfZi^A = (5.38a) 



stoch 

^(z) Jv{2) 



dZ-'dZ'^') = 6nii,dt. (5.38b) 

stoch 



This can be checked by substitution. This is not yet quite what one wants to be 
able to decompose into dYn^^ because the second phase in ()5.37j) has the wrong sign. 
Again using — = k_H, to relabel m — m, and applying ()5.33j) one has 

(dY^'^dY^^)^^^^^ = ^5^e^i^--e^''--V^^n,-.rft. (5.39) 

nm 

The phase factor is now fine, but the complex conjugate spoils the potential 
decomposition. What is needed are some stochastic increments dZ~^^ that will satisfy 



(dZi'A = (5.40a) 

\ / stoch 

dZi'^dZ^l^) = 6n,-ri.Undt, (5.40b) 

/ stoch 



to allow then a decomposition 



dY^^^ = ^^Y.^^--''dZ^^^- (5-41) 

n 

This can be achieved by separating out the real and imaginary parts of U as 

dZi'^ = V^^C^^ + \l^^dCi'\ (5.42) 
with the new stochastic increments 

>n 

tdCi^ 

:^C?^^S^> = ^n,-indt (5.43c) 



^Cr> , = =0 (5.43a) 

/ stoch \ / stoch 

dC~'dCi>) = 5^,^^dt (5.43b) 

stoch 



dQ'dO = 0. (5.43d) 

stoch 
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Note that yU~U"~ = iU~ by ()5.HH|) . Let us divide the n 7^ momentum mode 
space into two symmetric halves TZ and TZ, such that e.g. when V = 3: 



n G 7^ if < 



ni > 
or, ni = 
or, rii = 



and ^2 > 
and ?23 > 



(5.44) 



while n G 7?. if n ^ {TZ, 0}. The c/Q noises can now be realized by the construction 



(^) 



(5.45) 



(rfWji,i + idWn,2)/V2 if fi G 7^ 
dWo^i if fi = 

{dW_n,i - idW^i,^2)/V2 if fi G 

in terms of real independent Wiener increments dWnj, for all n G {71, 0}, numbering 
M in total^. d(^^ requires separate M independent noises dW^^^ and dW^^^. 
Collecting all this together, one obtains 



dX'^^ 

dxiP^ 



/3. 



hM 



'md&^ + Jui!d&^ 

n ' y n 



hM 



J2 e'*"" ' 



W) 



m 



(5.46a) 
(5.46b) 



All four and complex noises per (Fourier space) lattice point are independent 
of each other, and of the noises at all other (momentum) lattice points and times. 
Explicitly they have the form ()5.45|) in terms of the 4M real Wiener increments^ 
dWnj. Storage space for M complex variables f/^ is required, and calculation of 
these^ takes a time oc M log M — much more tractable than the calculations of f nm- 
The dXn^ reduce to the noise terms of ()5.17p under local interactions [/„ = 2hxSn,o- 
With no drift gauges, the two-particle interaction terms of the Ito stochastic 
equations become (the terms due to other processes are unchanged) 



day 



dPr. 



^ h 



f^|n-m| 

h 



Pna^Pm + dXi^\ 



(5.47a) 
(5.47b) 



+ 1 if M is even. 
^Or 4(M+ 1) if M is even. 
^Using the "fast Fourier transform" algorithm. 
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Drift gauges can be introduced by making the replacements 

dW^j ^ dW^j - dt, (5.48) 
in ()5.45j) (this follows straight from the standard form ()4.9()|l ) and 

dn = --- + n^Y^ Sn,jdWn,j. (5.49) 

n j 

Simulations of such models pose no fundamental problem (see e.g. Sections 110.51 
and 110.6]) . however the required computer time scales more steeply with M. There 
are now 2M complex terms to calculate in the equation for each variable rather than 
the 2 for a local interaction model ()5.17j) . 



5.6 Thermodynamics of interacting Bose gas 

The grand canonical thermodynamics of a system with Hamiltonian H in thermal 
and diffusive contact with a reservoir at temperature T and chemical potential /i 
can be simulated using the master equation ()2.29j) . 

With the locally- interacting Hamiltonian ()2.17|) . using the operator correspon- 
dences (15. 7p . and the methods of Sections IS3II1 E3!S1 one can obtain the FPE, and 
then directly by ()4.90|1 (not using diffusion gauges here, so gjk = 0) the stochastic 
equations. Using the same notation as in Section ESI these can be written 

dan = — /l^^CUnm«m'^'r + (/ie — 2/zXan/5n) ttn'^'T + ^anV^2^(rflVn — ^n) (5.50a^ 
m 

dPn = 0, (5.50b) 



dn = n 



5.50c) 



There are M real Wiener increments dWn, and hence the same number of complex 
drift gauges Qn- 

The asymmetric form (no evolution) arises because the Kamiltonian acts 
only from the left on the density matrix in ()2.27|1 . The initial jSn takes on a range of 
random initial values (see ()5.59|) below), which then remain constant. A symmetric 
set of equations is also possible by starting from the middle (anticommutator) term 
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of ()2.27|) . however this appears to serve no useful purpose but needs more noises 
(another set of M for the f3a evolution). 

The initial condition ()2.3H) is p„(0) = exp —XnN = (>?>nPn(0), where A„ is given 
by (j2321), and with 

Pn(0) = exp [-Xnalfin] ■ (5.51) 

One could use ()5.9j) and (jH.lOj) to obtain a gauge P distribution corresponding to 
this initial state, but a more compact distribution can be found as follows: 

Since Pu{0) is separable, let us just consider the initial conditions in a single 
mode n (all modes are in this same state initially). In a local Fock number state 
complete orthogonal basis \n), the initial state can be written 



Pn(0) = exp [-XnUn] -^n = ^ ^f~^" ^ '^^ ^ 



n\ 



= ^e-^""|r2) (n| . (5.52) 

n 

The local kernel ()5.4|) , on the other hand, expanding the Bargmann states, is 

A„ = e"-^" |0) (0|e^"^"e-""^" 

= J2-^\n) {n\e-''-^-. (5.53) 



nn 



<P2 

'n\n\ 

Since the high temperature state should be classical let us try a gauge P distri- 
bution over just diagonal coherent states and so postulate a = f3*. Also, at high 
temperature the modes should be separable, so let us try a distribution where the 
amplitudes at each mode are independent. From ()5.51|) . the state of each mode 
should be identical, and lastly, for simplicity, let us choose the initial weight of each 
trajectory to be equal: f2(0) = e^°*^°^ = 1 Let us start, then, with the Gaussian 
ansatz 



P'J'icx, f3, zo) = S\zo)6''\f3 - a*) n ^ 



(5.54) 



To see whether this is sufficient to represent the initial state, and to find the value 
of cr, let us substitute into ()5.5|) and see if ()5.52|) can be satisfied. Integrating over 
Q and /3, and separating modes, one has (for each mode n): 
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Writing = re*^, one has 

nn 

1 1 / \ 



1+1 



and using^72] r'^^'+^e-f^ dr = n\/2f "- 

One wants to have pn^{0) = pn(0)/Tr [pn(0)]. Since Tr [pn(0)] = 1/[1 - e"^"], this 
imphes 

^ = . ^ = ^/f , (5.58) 

(jiQ = l/[e'^" — 1] is the mean occupation per mode). So then, it has been verified 
that a gauge P distribution for Pu{0) is just a Gaussian in ct. 



Pg(«, /3, ^o) = 5'(^o)^'*'(/3 - «*) n ^ exp ( ^ilL ) . (5.59) 



Tcno \ no 

This is easily sampled to obtain initial values of = -^o = for each 

trajectory. 

For the case of extended interparticle interactions Unm = f^|n-m|; a similar pro- 
cedure can be followed as was done for the dynamics in Section l575l The diffusion 
matrix in the FPE is now 

D^nl = -^nmanam, (5.60a) 
D^^l = 0, (5.60b) 

rather than ()5.25|1 . With no drift gauges, the Ito stochastic equations are 

dan = — /i uJnmCem dr + iXeCHn dr — U\n-m\cenCimPm dr + dX^^ (5.61a) 



m 



d(3n = 0, (5.61b) 



dQ = Q 



nm n \ m 



dr (5.Qlc) 
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With the stochastic increments given by 

= ^any^E^^'"^"{y^^Cr + VWrfS"^}• (5-62) 

n 

Here the noises d^^'' and c^Ch"'' given again by ()5.45p . but the 2M real Wiener 
increments dW^j now have variance dr instead of dt. Drift gauges can again be 
introduced by making the replacements 

dW^j ^ dW^,, - g^,, dr, (5.63) 

in (lOKl) and 

5.7 Comparison with historical distributions 

To put the gauge P representation in perspective, let us compare to the more com- 
monly used phase-space representations from the field of quantum optics where 
phase-space distributions have arguably been most used. This section is based on 
Section 2 of the published article by Drummond and Deuar|nSl- The concepts and 
general layout of this section are due to P. D. Drummond. 

To understand the reasons for development of the various distributions, it is 
useful to peruse the requirements listed in Section ITHl that they have to satisfy to 
result in stochastic simulations of many-body models. 

Phase-space mappings were first introduced by Wigner as the famous Wigner 
representation's 73. Historically, phase-space distributions have developed in three 
stages: 

• Stage one (classical-like phase space): A classical-like phase space was 
used in which the number of real configuration variables in C was the same 
as the number of classical degrees of freedom. Typically the kernel is of the 
form of a diagonal projector, and not all quantum states can be represented by 
positive nonsingular P. This usually manifests itself as either a non positive- 
definite propagator or 3rd order terms in the FPE in master equations in- 
volving several-body processes. Either way, a quantum-equivalent stochastic 
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process is not recovered. The Wigner|57j. Husimi Q[ZH], Glauber-Sudarshan 
P|SHl mi EH], and the Poisson[7H | ITH j IH^ representations, among others, all 
fall into this category. 

• Stage two (doubled phase space): By the use of off-diagonal kernels, rep- 
resentations in a higher- dimensional phase space were developed, for which a 
non-singular distribution P exists for all quantum states. These typically have 
at least two real configuration variables for each classical degree of freedom 
(one for the ket and one for the bra in the kernel). Examples of such represen- 
tations are the Glauber RlHHl IZ1| and the positive P[nillII] representations. 
While these work very well for highly damped systems, in models with several- 
body processes and low damping, they typically develop unstable trajectories. 
This leads to large sampling uncertainties or even systematic "boundary term 
errors" HOI HI]. 

• Stage three (global amplitude and stochastic gauges): Addition of a 
global weight to the kernel allows the introduction of drift stochastic gauges as 
in Section lOl and (by appropriate gauge choice) modification of the stochastic 
equations to remove the instabilities. The gauge P representation introduced 
above is of this type, as is the stochastic wavefunction method of Carusotto et 
a/piinni- This work of Carusotto et al is in some ways complementary to that 
presented in this thesis, and has been developed approximately in parallel. One 
fundamental difference between the two representations is that the gauge P 
representation allows the particle number to vary, allowing open behavior such 
as lasing or evaporative cooling, whereas the stochastic wavefunction method 
is hardwired to a constant particle number N . 

How do these various representations compare? A check of their applicability 
to interacting Bose gas simulations with Hamiltonians ()2.17|) or ()2.12j) is shown in 
Table 15.11 One can see that the early distributions were often hindered in obtaining 
a many-body simulation by a whole variety of problems. 

For completeness, it should be mentioned that there are other distributions of the 
general form ()3.1|) that are more suited to non-stochastic calculations. For example 
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Table 5.1: Check list of required representation properties for the more commonly used phase- 
space representations, when applied to a lattice interacting Bose gas Hamiltonianof the forms (|2.17() 
or (|2.12l) . with no external coupling. 



distribution 

type 


positive 
real 


complete 
non- singular 


FPE'^ 


positive 
propagator 


stable 
unbiased 


UV 
convergent 


open 
systems 


Wigner 


no 


yes 


no 


varies^ 


- 


no 


yes 


Q 


yes 


yes 


yes 


no 


- 


no 


yes 


p 


yes 


no 


yes 


no 


- 


yes 


yes 


R 


no 


yes 


yes 


no 




yes 


yes 


positive P 


yes 


yes 


yes 


yes 


no 


yes 


yes 


sw.*^ Fock 


yes 


yes 


yes 


yes 


yes*^ 


yes 


no 


sw.^ coherent 


yes 


yes 


yes 


yes 


yes^ 


yes 


no 


gauge P 


yes 


yes 


yes 


yes 


yes'^ 


yes 


yes 



"i.e. only 1st and 2nd partial derivative terms in dP/dt. 

^There are no second order terms in the FPE for dynamics, while for thermodynamics the 
propagator may or may not be positive semi-definite, depending on occupation of modes. 
'^Stochastic wavefunction 

''Given an appropriate choice of gauge — see Chapter [3 



the symplectic tomography scheme of Mancini et a/jZZl EH], which expresses the 
quantum state as a probability distribution of a quadrature observable depending 
on a range of lab parameters. This has been used to investigate quantum entangle- 
ment and failure of local realism, but apparently has not led to many-mode quantum 
simulations, presumably due to the lack of a positive propagator in nonlinear evolu- 
tion. The complex P representation[ll allows one to derive exact results for certain 
problems, but does not lead to stochastic equations, since the distribution is neither 
real nor positive. 
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Chapter 6 

Boundary term errors and their 
removal 

As has been mentioned previously, when one attempts to simulate non-separable 
physical models using stochastic equations derived via phase-space distributions, 
instabilities in the equations are common. These can give rise to systematic biases 
in the observable estimates ()3.14|1 using a finite number of samples. Sometimes 
these biases abate at large sample number (Such a situation is considered in detail 
in Appendix^, while in other cases the biases remain even in the limit S ^ oo. 

The latter have been called boundary term errors because the root of the biasing 
lies in discarding nonzero boundary terms in integrations involving the distribution 
P{C) (and some factors) over C. Rigorous results regarding how and when these 
biases arise in practical simulations are few, and many researchers while aware of 
phase-space distribution methods, are either 1) unaware of the potential for biases, 
or 2) know that biases exist and because of this avoid using phase-space distribution 
methods altogether. With the aim of reducing this confusion, in Sections 16. Il and f6. 21 
the mechanisms by which these biases come about are investigated and classified for 
general phase-space distributions. To clarify the conditions under which boundary 
term errors may be a problem, some symptoms that can be checked for by inspection 
of the stochastic equations and observable estimators are pointed out. 

It will be seen in Chapters [7| and IHl that simulations of interacting Bose gases 
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can contain boundary term errors (at least when using coherent state kernels), and 
a method to overcome this problem is developed in Section Ifi.HI Subsequently, this 
un-biasing procedure is demonstrated on the two rather different examples from 
the literature where boundary term errors have been seen when using a positive P 
representation: Two-boson absorption (Section l(j.4|l . and a single-mode laser model 
(Section 16. 5j) . Removal of possible or confirmed biases in the interacting Bose gas is 
carried out in Sections 17.31 and respectively. 

6.1 Boundary term errors of the first kind 
6.1.1 Origin and general form 

During the derivation of the Fokker-Planck equation (from the original master equa- 
tion), there is a point between 1)3.411) and ()3.42|) where integration by parts is per- 
formed, and boundary terms discarded. The correspondence between full quantum 
evolution and FPE (and hence the final stochastic equations) occurs only if these 
discarded boundary terms are zero. Otherwise, there is a discrepancy between quan- 
tum mechanics and what is calculated by the stochastic equations in the limit of 
iS ^ oo. This has been termed a boundary term error. Let us see what exact form 
these discarded boundary terms take. 

The partial integration is made on a master equation that already contains all 
kernel gauge expressions ()4.19|) . ()4.12|) and/or ()4.2|) as well as the basic nonzero terms 
arising from directly applying the operator correspondences ()3.39|) to the master 
equation. In any case, all these terms can be incorporated into the formalism of 
With the real variable set C = {Cj}, when going from to (HO^ the 

discarded boundary terms are found to be (after some algebra) 



§I^\C) = P(C)T«(C)A(C) + Ip{C) Y.Ttf-^ - E ^ [nC)T^^,{C) 

k k 

(6.1b) 





where 
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The usual definite evaluation notation [/(f )]„ = lim^^b fiy) — limt,_^a f{v) has been 
used. Note that the kernel is still an operator, and hence so is the boundary term 
expression B. Every matrix element of the boundary term operator should be zero 
for the correspondence between quantum mechanics and the FPE to be exact. Such 
matrix elements could in principle be evaluated by inserting the elements of A and 
dA/dCk, which can usually be worked out exactly, into ()(i.l|l . This will be attempted 
for the gauge P representation in Section [6. 1.5|, and a toy example of the calculation 
of B is given in Section 16.2.21 

While fl6.1|) is nontrivial, there are generally a variety of ways in which it could 
turn out to be zero. 

1. Especially for an open phase space, one expects that P{C) will tend towards 
zero at the boundary, and if this rate is faster than the growth of the mul- 
tiplying factors {T^*^{C) times the matrix elements of A), then the terms in 
()6.1|) will be zero. 

2. Even if individually the terms in 1)6.11) do not tend to zero, but the distribution 
P{C) times multiplying factors is an even function of Cj, then the values 
evaluated at max[Cj] and min[Cj] may cancel each other. A typical example 
of this is when a particular Cj = 6 is the phase of a complex variable — any 
function of 6 evaluated at the limits of its domain [— tt, tt] is the same at both 
limits, and any boundary terms in 6 sum to zero. 

3. Also, there may be a cancellation between separate terms in the expression 

Conversely, the boundary terms can easily turn out to be nonzero in an open phase- 
space if -P(C) drops off too slowly as \Ck\ grows (typically — if the tails of P{C) 
obey a power law), or, in a partly or fully closed phase-space, when P{C) 7^ at 
the boundary. 
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6.1.2 Symptoms indicating boundary term errors 

Unfortunately, it is not generally possible to directly evaluate the boundary terms 
()6.1|) and check correctness, because this involves knowing at least the far tail be- 
havior (in an open phase space) of P{C) as a function of time. In a fully or partly 
closed phase-space one needs to know the exact behavior near the boundary. These 
require detailed knowledge about aspects of the full solution to the master equation 
— generally not achievable in non-trivial systems, although power law tails of P{C) 
can be considered a bad sign. 

What is really needed are some symptoms of likely boundary term errors that 
could be looked for in the stochastic equations before simulation. Since boundary 
term errors arise when the distribution falls off too slowly as the (possibly open) 
boundaries of phase space are approached, at least four (there may be more) broad 
classes of features come to mind that indicate the probable presence of boundary 
term errors of the first kind. 

1. The first such symptom, fairly well known, is the presence of so-called moving 
SINGULARITIES. In a set of deterministic equations, these manifest themselves 
as solutions (usually a set of measure zero) that diverge in a finite time. Previ- 
ous studies |M1 lUB] have shown that when moving singularities are present in 
the deterministic parts of stochastic equations, boundary term errors usually 
result. In particular, it was seen that 

• In a number of studied systems, moving singularities in equations derived 
using the positive P representation occur whenever systematic errors are 
seen in observables calculated with estimators polynomial in the system 
variables. 

• In these systems, when the moving singularities are removed with the aid 
of stochastic gauges (as explained in Section lU^ . the systematic errors 
are removed as well|66j. 

The presence of pathological behaviour when moving singularities are present 
can be grasped intuitively: Due to the stochasticity of the equations, and 
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the continuous nature of the noise, one expects that after small times, the 
distribution of trajectories (at least the far tails of it) will extend over all of 
phase space. In particular into those phase-space regions where the solutions 
that diverge in a finite time occur. While the divergent trajectories are usually 
a set of measure zero, an infinitesimal proportion of divergent trajectories 
may have a finite effect on the observable means, on the general basis that 
X oo does not necessarily equal zero. However, because the number of such 
trajectories is infinitesimal in comparison with the rest, this effect can never 
be estimated by a finite sample of them, hence the systematic errors. 

Moving singularities may arise either in a dynamic fashion (e.g. in an equation 
dv — v'^ dt, v{t) — v{0)/[l — tv{0)], and a divergence occurs at t — l/v(0)), 
or due to divergent drift terms for finite phase-space variables (e.g. dv — 
dt/{l — v) at V — 1, or dv — dt/{l — t) at t — 1). In the latter situation the 
divergent trajectories may even be a set of nonzero measure. 

2. Secondly, one should also be wary of NOISE divergences — i.e. instabili- 
ties in the noise matrices B. For some small set of trajectories the Wiener 
increments dW{t) will conspire to act approximately as a constant (AH^) fa 



fective drift term proportional to B, and if such "drift terms" give rise to what 
effectively are moving singularities, then systematic errors will also result by 
an analogous mechanism as for instabilities of the drift terms. 

3. Thirdly, Another indicator of possible boundary term errors is a DISCONTINU- 
OUS DRIFT and/or noise MATRIX B. This may cause an effect that can not 
be simulated stochastically, as the discontinuity is only ever sampled by a set 
of trajectories of measure zero. In particular, there may be a region near such 
a boundary that contains only an infinitesimal portion of total trajectories, 
but those trajectories have large impact on the observable estimates. 




over a time interval At ^ dt. This results in approximately an ef- 



4. Fourthly, if the INITIAL DISTRIBUTION IS TOO BROAD, boundary term errors 
may be present from the outset. An example would be a Lorentzian ini- 
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tial distribution of variable v{0) if the Fokker-Planck equation contains terms 
quadratic in v. Another example is the single-mode laser model of Section IfTHl 
with non-delta-function initial conditions. 

If any of the above symptoms is present, boundary term errors may be suspected. 
Conversely, if the symptoms are absent, it appears likely that no boundary term 
errors occur, but unfortunately rigorous results for general distributions are hard to 
come by. No boundary term errors were seen for any simulation that did not suffer 
from one or more of these symptoms. 

There have also been developed several numerical indicators (described in detail 
in jM] ) that one can use to check for the presence of boundary term errors from a 
finite set of trajectories. 

6.1.3 Moving singularities in a complex phase-space 

For a set of stochastic equations (e.g. such as ()3.45|) or ()4.90|) ) it is relatively easy to 
rule out boundary term symptoms 3 and 4 from the previous section (by inspection). 
Moving singularities and noise divergences (symptoms 1 and 2) are a little more 
slippery, but practical conditions for their absence can be stated. Simulations in 
this thesis are with complex variable sets on an open phase space, so here a practical 
condition for absence of moving singularities will be derived only for this particular 
(though commonly occurring) kind of phase-space. 

In an open complex phase space, barring symptoms 3 and 4, moving singularities 
cannot occur if the both the drift and noise matrices of the stochastic equations 
contain no radial components that lead to super-exponential growth. This is because 
then all variables remain finite at finite times, and so at finite times never reach the 
"boundary" at modulus infinity. Since z = ze^™ for any complex z^ tangential 
evolution does not lead to divergences, because boundary terms evaluated at the 
boundaries of the polar variable {/^z = or 2tx) are equal, and thus cancel in ()fj.la|l . 

More precisely, for a set of stochastic equations 

dzj = Aj{C) dt + J2 Bjk{C)dWk{t), (6.2) 

k 
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in complex variables Zj (C = {zj}, including, possibly zq), this gives the Condition: 
• Moving singularities or noise divergences will not occur provided that the limits 

A- B k 

lim — ^ and lim (6-3) 

converge for all Zj G C and all fc, and that symptoms 3 and 4 of Section [6.1.21 
are ruled out (by inspection). 

6.1.4 Generic presence in many-mode simulations 

Off-diagonal kernels appear necessary to allow all quantum states to be described 
by a positive real distribution that can be sampled stochastically (see Section l3. 2. 3|) . 
In such a case, the independent "ket" and "bra" parameters of the kernel will (if 
complex) almost always enter the kernel in an analytic way. Then, to ensure a 
positive propagator, requires in turn that the complex derivatives be chosen as in 
Section 13. 4. 3^ preserving the analytic nature of the formulation into the stochastic 
equations. 

The presence of moving singularities in (deterministic) dynamical equations for 
an analytically continued phase-space is a well-studied problem for both ordinary 
and partial differential nonlinear evolution equations. According to the Painleve 
conjecture (see, e.g. jT^l HOI EI]), this is equivalent to non-integrability - and is 
therefore generic to many (in fact, all but a set of measure zero) complex dynamical 
equations with large numbers of degrees of freedom. Just the sort one obtains when 
simulating many-body systems. 

Kernel gauges are, however, not restricted by the above mechanisms to be ana- 
lytic. This is because they arise as arbitrary (hence, possibly non-analytic) multipli- 
ers to operators on the kernel, rather than as a result of applying analytic operator 
identities ()3.39|) to it. Because of this freedom of gauge choice, it has been conjec- 
tured by P. D. DrummondjSn] that breaking the analyticity of the complex equations 
using stochastic gauges is a necessary (though not sufficient just by itself) condition 
to remove boundary term errors. This agrees with previous work|661 Q], and also 
with the gauges used to remove boundary term errors in this thesis. 
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Diffusion gauges, while possibly non-analytic, do not appear useful for removal of 
boundary term errors of the first kind because they appear only at the FPE-Langevin 
stage, after any nonzero boundary term operators have been already discarded. 



6.1.5 Boundary terms in the gauge P representation 

Let us consider the gauge P representation. The kernel ()5.4|) . expanded in the 
complete Fock number state basis for each subsystem is A = f2 (S)^ A^, with 



nn 



^ fyn an 

A. = 5^^|n),(n|,e-"^-^N (6.4) 
— • 'n\n\ 



as in A master equation leads via ()5.7j) . without invoking gauges, to terms 

T^^ and T^^Jl polynomial (of low order) in and [3k. Use of a weighting gauge will 
lead to some terms T*^*) oc Vt. Drift gauges could generally lead to any form of the 
corresponding coefficients T^'\ but let us assume that the drift gauges Qk, etc. are 
polynomial in a^, [3k, a^, [31, and autonomous (i.e. not dependent on VL). Under 
this assumption, the terms T^*"* due to drift gauges will be polynomial in all the 
variables, and at most oc Vt^ (Diffusion coefficient in Vt due to standard drift gauge 
introduction is oc Vl?Ql). 

Combining this together, the (matrix elements of the) multipliers of P{C) ap- 
pearing in the boundary term expression ()6.1|) will at most scale as : 



< oc JJa^^/^^^e-^^^S (6.5) 

k 

where the coefficients take on all values ctfc > and 6^ > for various matrix 
elements. 

In an open phase space of complex variables Zj (as here), the boundary lies at 
\zj\ ^ oo. The multipliers of P(C) in ()6.1|) scale in these far tails as 

< oc JJ|afc|"'=|/3fe|^'=e"^l"^-ll^''l, (6.6) 

k 

where = — e*^"*e*^^*=, can have positive real part. This implies that if one wants 
to be sure that no boundary term errors are present, (assuming any drift gauges Qk 
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are polynomial in ak, Pk, or their conjugates, and independent of fl), then: P{C) 
should decay in the far tails no slower than 

P{C) < oc (6.7a) 
P(C) < oc exp(-Cfc|a,.r), (6.7b) 
P{C) < oc exp(-Cfc|Ar'=), (6.7c) 

where bk > 0, Ck > and > 1. (And there should be no time- dependent discon- 
tinuities in the drift or noise equations). 

This does not preclude that by some canceling of terms, a less favorable scaling 
of the tails may also become acceptable, but certainly if P scales like ()6.7|) or better 
in the far tails, boundary term errors (of the first kind) will be absent in a gauge P 
representation simulation. Note in particular that from ()6.7|) : 

1. Power law tails of P{C) in the a, f3 part of phase-space will invariably lead to 
boundary term errors. For some matrix elements of B for which the coefficient 
of (n|^ in ()6.4j) is of high enough order, the boundary terms will become 
nonzero. This is consistent with the conclusions drawn by Gilchrist et al |64j 
that power law tails in a positive P distribution are an indicator of boundary 
term errors. 



2. Exponential decay of P{C) with la^l or \j3k\ is also not good enough in general. 
This is because the coefficient of (say) la^l in the exponents of ()6.7j) is not 
constant, but oc \(3k\ — which may be any arbitrarily large value. Only a 
faster-than-exponential decay will be able to overcome this when \l3k\ is large. 
However, Gilchrist et a/^^ found no boundary term errors in a single-mode 
anharmonic oscillator ()6.9|1 simulation for which the far tail behaviour of P+ 
appeared to be exponential, and no indications of boundary term errors in this 
system are seen here in Chapter[7|either. This may be because the single-mode 
system is a special case — certainly in Chapter |H1 it is seen that when acting 
on several coupled modes, a similar Hamiltonian can lead to boundary term 
errors. 
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6.2 Boundary term errors of the second kind 

Biases that do not abate for 5 ^ oo may also arise via a different mechanism when 
the estimator functions to be averaged to obtain the expectation value of a given 
observable grow too rapidly in phase space. 



6.2.1 Mechanism and overview 

Working at the level of the distribution -P(C), expectation values of observables 
are given by the real part of (jH.lHj) . When one takes a number S of samples of 
this distribution, the estimator ()3.14|) is used, but if the real parts of the integrals 
in ()3.13|) do not converge then this finite sample estimate does not approach the 
quantum mechanical value even in the limit S ^ oo. Systematic errors arising from 
this source will be called here boundary term errors of the second kind, and have 
very different properties to those of the first kind. Explicitly, from ()3.13|) and by 
imposing Hermiticity, the integrals that must converge are 

j PiC) RejTr O (a(C) + At(C*)) } dC (6.8a) 
j P{C) RejTr A(C) } dC, (6.8b) 

for any particular observable O. 

One can see that even when (j6.8bj) is convergent, errors may occur for some ob- 
servables for which only ()6.8ap diverges. Given a particular distribution, divergences 
may come about due to the matrix elements of either the kernel A, the operator O, 
or both growing too rapidly as the boundaries of phase space are approached. Ob- 
servables that involve an infinite sum of moments polynomial in the system variables 
(such as the parity in the gauge P representation, whose estimate is given by the 
sum fl5.15|l ) may be particularly prone to these boundary term errors, because the 



dependence of Tr 



OA 



on the phase-space variables will grow particularly rapidly. 
Boundary term errors of the second kind depend on the equations of motion 
only by their effect on the distribution of samples P{C), and so do not show up as 
moving singularities or noise divergences in the equations. 
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A sobering point is that, in theory, one could always come up with some ob- 
servables that grow fast enough as the boundary of phase space is approached to 
overcome whatever far tail behaviour occurs in A(C)P{C). Such an observable could 
be defined as an infinite sum of polynomial moments whose terms grow appropri- 
ately rapidly. Thus, boundary term errors of the second kind are likely to occur for 
some observable estimates for any phase-space distribution simulation method, for 
any model. In practice, fortunately, it suffices to make the estimators of only all 
observables of interest convergent. 

6.2.2 Example: un-normalized positive P representation 

A good example of boundary term errors of the second kind occurs for the "anhar- 
monic" oscillator single-mode Hamiltonian, when using an un-normalized coherent 
state kernel. This Hamiltonian^ is 

H = ha^^a^, (6.9) 

the single-mode analogue of a repulsive two-body potential in the interaction pic- 
ture^. Let us consider the kernel 

A = (6.10) 

off-diagonal in the un-normalized Bargmann coherent states flS.lj) . It is convenient 
to change to logarithmic variables = log(a/3), and rriL = log{a/l3), then the basic 
operator correspondences are, from ()5.6|) . 

aA= aA= e^'^^+'^^^/^A (6.11a) 

d_ _d_ 
^driL drriL 

Aa= ^A= e("^^-"^)/2 - A. (6.11d) 

\dnL dmLj ^ ' 

It is also convenient to split the variables into real and imaginary parts: = n'^ + 
in'[, and = + im'[. For a dynamical evolution model with no coupling to the 
^In some appropriate time units. 

^That is, with terms linear in a and transformed away. Compare with H2.17|l . 



atA= £A= e-(-+"^)/M 7^ + 7^ ) a (6.11b) 
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environment, utilizing the analytic nature of the kernel as explained in Section 1^3131 
one obtains the FPE (provided no boundary term errors of the first kind are present) 



dP ( d & 



dt 1 dm'[ ' dn'i^^ dn'[^ dm'j^'^ dm'P 

Q2 Q2 Q1 Q2 



dn'^dm'[ dm'[dn'^ dn'^dm'^ dm'^dn'[ 



P. (6.12) 



This gives the Ito stochastic equations (with standard square root form of Bq = \/13) 



dnL= iV2i[dW{t) -idW{t)] = 2d7]{t) (6.13a) 
dmL = iV2i [dW{t) + idW{t)] -2idt = 2i[dr]*{t) - dt]. (6.13b) 

dW{t) and dW{t) are the usual independent Wiener increments, and the complex 
noise dri{t) defined here obeys 

(^^(i)).toch = (6.14a) 

W(t)rfr^(t')),toch = S{t-t')dt' (6.14b) 

{dr,{t)dv{t')),,,,^ = 0. (6.14c) 

The FPE (or stochastic equations) can be solved, and in the case of coherent state 
initial conditions (with initial amplitude ao) 

P{nL, rriL, 0) = 6\mL - 2tZao) S^ul - 2 log |«o|), (6.15) 

the distribution at time t is 

P{nL,mL,t) = S^(^mL— i{n*];^ — 2 log \ao\) — 2it + 2iZao 



1 f |ni-21og|ao| 



|2 



In this toy model, one can actually check if there really were no boundary terms 
of the first kind, by substituting ()6.16|) . and ()6.10|) into the expression for E. The 
boundary term operator B is given by the integrals ()6.1|) of operators B^^^ composed 
of P, A and coefficients T^*\ with individual terms of B evaluated at the boundaries 
of phase space. In our case these boundaries are Cj — oo and Cj oo for all 
j = 1, . . . , 4, and one requires all the matrix elements of B to be zero. By comparison 
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of ()6.12|) to ()3.43|1 . and labeling variables as = Ci, n'[ = C2, m'j^ = C3, and 
m'l = and also setting the term counter / = 1, the nonzero T^'^ coefficients in 
the Bf' are T^^ = -2, T^;]] = 2 for all j, and = T^i = T^^l = T-fsl = -2- Also, 
the matrix elements of A (in a Fock basis |n)) can be written (from the definition 
dSH} of ||a)) as 

(6.17) 
So: 

1. The T^*^ coefficients are constants. 

2. The matrix elements of A and dA/dz are exponential in both variables z = ni 
or rriL, with some constant positive or negative coefficient. 

3. The distribution P is Gaussian in real and imaginary parts of both variables 
{riL around mean 21og|ao|), mi around mean 2i{Zao — t)). 

One can see that all matrix elements of all the integrands ((n| \n)) will be dom- 
inated by the Gaussian decay of P{C) in the far tails for all n and n. For j = 1, 2, 
after integration over mi and a (real or imaginary) component of ni, the expression 
to be evaluated at large ni will be Gaussian, hence zero in the far tail limit. Also, 
for the terms (j = 3, 4) in which one integrates over and a component of rriL, the 
expression to be evaluated will be a Dirac delta function — also zero in the far tail 
limit. Hence, 

B = 0. (6.18) 

Thus it has been con&med that there are no boundary term errors of the ffist kind. 

So far it seems like this might be a very good kernel to try even for many-body 
locally interacting Bose gases, which have the same kind of interparticle potential 
term. The advantages would be that there are no boundary term errors of the ffist 
kind, no moving singularities (compare to Section l6.1.4|) and even no nonlinear terms 
in the stochastic equations (compare to ()5.17|) ). 



{n\ A \n) 



A /t) It) 1 



exp 



ni(n + n)/2 + rriL^n — n)/2 
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Having the luxury of knowing P{C) for this toy problem, let us see what happens 
when one tries to calculate an observable. Directly from ()6.1()j) and ()5.3|1 : 



Tr 



A 



exp(a/3) = exp [e 



(6.19) 



The essential denominator term ()6.8b|) in any observable calculation, is then (using 
()6.16p and defining rii = n'^ + ^^l); 



Reiexp [e"^]} e 

^ Ant 



|ni-21og|ao||2/4t^2^^ 



OC 



exp 



cos n'[ 



cos (e< sin<) g-K-2iog|ao|)V4ie-K)V4t ^2^^ (6.20) 



For riL with large positive real part, the factor from Tr A dominates (it's an expo- 
nential of an exponential!), and since the integrand is symmetric in n'[ and nonzero, 
it must become unbounded, and the integral diverge. 



A plot of the integrand PTr 



A 



for several time values is shown in Figure IFTTl 
The unbounded behaviour at high n'^ values indicates that ()6.8b|) will diverge and 
boundary term errors (of the second kind) will be present. A comparison to the 
probability distribution of is also made there to indicate which contributions to 
the integral ()6.8bjl are sampled by a stochastic simulation. 

This means that: All observable estimates using fl3.14|) are suspect with this 
representation! 



6.2.3 In the gauge P representation 



A 



For the gauge P representation, Tr 
becomes simply 

^P(C) e^o cos z'f^dC 



, and so the denominator integral (j6.8bjl 



(6.21) 



in terms of zq = Zq + iz^. This will be convergent provided the far tails of the 
marginal probability distribution of z'q decay as e~^o, or faster in the limit I^qI 00. 

From the estimator expression ()5.1Hj) for an arbitrary observable that is a poly- 
nomial function of a and a\ one can see that the numerator integral ()6.8ap will be 
of the form 



(6.22) 
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Figure 6.1: Integrands appearing in the denominator of observable estimates for anhar- 
monic oscillator dynamics governed by the Hamiltonian H().9|) . Values plotted are (Solid line): 



/ Tr A(C) PiC)(fmL, evaluated (using and ^J^) at 



0. Coherent state initial 



conditions (|6.15(l were used with mean occupation number unity (|q;o| — !)• To show what parts 



of the integral (|6.8b|l would be sampled by a stochastic simulation, the probability distribution of 
TiL evaluated at = (i.e. J P{C)cfmL evaluated at n'l — 0) is also plotted as the Dashed 
LINE. Note the unbounded behaviour of the integrand at large positive values (which are not 
sampled) . 



where the functions for each term fj are 



/.(C) = 




with some non- negative integer coefficients and bk, which take on different values 
for different fj. The behaviour of each such term as one approaches the boundaries of 
phase space, where the modulus of at least one variable tends to +oo, is exponential 
in Zq (with unity coefficient), and polynomial in \ak\ and \(3k\- 
This means that 

• Boundary term errors can be expected for observables polynomial in and/or 
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al if the tails of P{C) decay as any power law in the \ak\, or \Pk\, or as e ^'o 
or slower in Zq. 

Conversely, if the tails of P{C) decay faster than power law in all \ak\ and and 
faster than e~^'o in the modulus of the weight, then boundary term errors (at least 
of the second kind) will not occur for any observable polynomial in the annihilation 
and creation operators on subsystems. That is, any observable given by ()5.10|) . or a 
finite linear combination of such expressions. 

Power law tails of P{C) in \ak\ or \i3k\ do not immediately exclude all observables, 
only those whose estimators grow too rapidly. By inspection of ()5.1()|1 . ()5.1Hj) . one 
can see that an observable that is an a^th polynomial of will lead to a term 
fj oc e^o|aA;|"'=. If P{C) oc e~^^o|afc|~"'= in the far tails, then the indefinite integral 
/ P{C)fj{C)dC will be oc e(i-^)<)|afc|"'=-"'=+i (if 0^-0^ + 1 ^ 0), or oc e^^-'^^^o log \ak\ 
(otherwise) in this far tail region. When this is evaluated in the far tail limit, 
the integral will convergent if the decay of P{C) is fast enough. Similarly for the 
relationship between a|. and jSk- Hence, one can state the conditions for expectation 
value estimates of a particular operator A to not suffer from boundary term errors 
of the second kind: When ej are infinitesimal positive real constants, one requires 

A oc 0?'= ^ lim PiC) < oc - — , }^ ^ , (6.24a) 
A oc al^'^ =^ lim P{C) < oc , ^ , ^ , (6.24b) 



as well as 



lim P{C) < oc e~(^+'^)"o. (6.24c) 



Non-polynomial observables defined as infinite sums (e.g. the parity ()5.14|) ) can 



AO 



etc. that are to be sampled 



be more troublesome, because the functions Tr 
by ()6.8aj) can grow faster than polynomially as |Q;|jt, 00. Conditions of the 

kind ()6.24|) can also be derived, but will not scale as a power law. For example since 
the parity estimator ()5.15p has a numerator expression that scales oc e"^^''''^"*'^*^, 
one requires P{C) to decay faster than e'^^^'^°'^^^^ in the far tails for an unbiased 
estimate of parity expectation values (remember that ak(3k may have negative real 
part in highly non-classical parts of phase-space). 
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6.2.4 Locally-interacting Bose gas calculations 

Here the likelihood of boundary term errors of the second kind is considered for 
a many-mode interacting Bose gas model using the gauge P representation as de- 
scribed in Sections 15.31 and I5.61 By inspection of the dynamics and thermodynamics 
gauge P equations for the locally interacting Bose gas ()5.17j) - ()5.2H) . and one 
sees that the stochastic terms (assuming no gauges) take the forms 

da^ ■■■+ candWn (6.25a) 
dpn ■■■+ cpndWn, (6.25b) 

with some complex constants c and c, apart from possibly additional terms caused 
by finite temperature heat bath interactions (j5.18j) 

dan \- \/lnnhs.thiT)dr]n (6.26a) 

dPn h V7nnbath(T)(i?7*. (6.26b) 

In this Subsection, the far-tail regime of phase space where |an| and/or |/3n| are large 
will be of interest, and there the terms ()6.26|) are negligible, and hence are ignored 
here. 

Upon change of variables to = logOn and = log/5*, one finds by the rules 
of Ito calculus that these logarithmic variables will have stochastic evolution 

dzn ■■■+ cdWn (6.27a) 
dzn ■■■+ cdW^. (6.27b) 

This indicates that the short time evolution of z^ and in is dominated by Gaussian 
noise. Thus at short times, (defining z^ = z'^ + iz'^, and z^ = z'^ + iz'^ 

Changing back to the coherent state amplitude variables. 
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The far tail decay as a Gaussian of a logarithm of |a;n| or |/?n| is strong enough to 
overcome any polynomial of these variables if one moves far enough into the tails, 
(polynomials will be only exponential in the logarithm). So, there is no immediate 
reason to suspect boundary term errors of the second kind for polynomial observables 
here. (They cannot be ruled out by the above discussion, however, because at longer 
times the evolution of -P(C) is much more complicated and no proof for such a regime 
has been given). 

On the other hand, ()6.29p does not decay strongly enough to make an average 
over a parity estimator converge, since these latter can grow as fast as exp(2e^""^^") 
(when anPn is negative real), which grows much faster than 1)6. 2 9j) decays. One 
thus suspects that parity expectation values will not be estimated by ()5.15p in an 
unbiased way for a gauge P representation of the locally interacting Bose gas model. 

6.2.5 Unresolved issues 

Lest one think that boundary term errors are fully understood, a variety of issues 
are still unclear (to the best of the author's knowledge). Some of these have to do 
with tradeoffs between the two kinds of error. For example: 

• The role of normalization: In Section [6.2.21 it was seen that for a (single- 
mode) interacting Bose gas, the un-normalized positive P representation did 
not suffer boundary term errors of the first kind, but had severe problems 
with errors of the second kind when Re {a(3} — >■ +oo. On the other hand, the 
normalized positive P representation or the gauge P do not suffer from the 
second kind of error for most observables, but (as will be seen in Chapter ^ 
does have problems with moving singularities, and hence presumably the first 
kind of boundary term error, when Re {ajS} — oo. 

There appears to be a tradeoff between the two processes causing boundary 
term errors, which can involve the normalization of the kernel. Nevertheless, 
the normalization does not parameterize the tradeoff in a straightforward way, 
since the two kinds of boundary term error occur in different regions of phase- 
space. 
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• The role of moving singularities: Only the first kind of boundary term 
error appears during derivation of the FPE and stochastic equations while on 
the other hand, errors of the second kind can occur simply on a static initial 
distribution, which does not evolve in any way. For these reasons it appears 
that moving singularities in the deterministic part of the stochastic equations 
should be related to boundary term errors of the first kind. 

On the other hand, moving singularities do lead to pathological tail behaviour 
in P{C) after a certain time tsing (even without any noise terms, if the initial 
distribution is nonzero on any point of a trajectory that escapes to infinity at 
finite time.) Because the far tail behaviour of P{C) changes in this qualitative 
way after tsing, one should be able (at least in some cases) to find some ob- 
servables that would be estimated in an unbiased way for t < tsmg, but suffer 
from boundary term errors of the second kind for t > tsing- So it appears that 
moving singularities can also influence the second kind of boundary term error 
in some way, presumably by causing slowly decaying distribution tails. 

A better understanding of the relationship between the two kinds of error might 
lead to a way to make an advantageous tradeoff that avoids both, at least in some 
cases. 



6.3 Removal of the errors using kernel gauges 
6.3.1 Conceptual justification 

If one introduces a kernel gauge (e.g. a drift gauge Qk) using a gauge identity 



J 



A 



: 0, deflned by ()4.1|) as in Section 14.11 then zero is added to the master 
equation in the form J P{C)J-'{C)J A dC, as in ()4.2j) . Introducing such a kernel 
gauge can have an effect on the presence of boundary term errors in two ways: 

1. Modification of the distribution: A change in the FPE due to addition of 
extra gauge terms will cause a direct change to its solution P{C,t). A good 
choice of gauge can lead to faster decaying tails, and hence directly to i3 = 
from evaluation of expression ()6.1|) . and no boundary term errors of the flrst 
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kind. This is the easiest approach in practice, and will be considered in some 
detail in the next Subsection 16.3. 21 

Less directly, this approach also has the potential to remove boundary term 
errors of the second kind. This is again because P{C) changes, and if the far 
tail behaviour is made to decay faster, the observable expressions (I6.8a|l and 
()6.8b|l may become convergent. 

2. Renormalization of boundary terms: The expression fl6.1bjl gains extra 
gauge terms, which can have an effect of the final value of B. During the 
derivation of the FPE in Section 13.4.11 these extra added terms to the master 
equation can be written in the T^'^ notation of 1)3.411) and also (jfi.lj) for B as 

To(°) = jrj(o) (g_30a) 
Tl^f = J^jf^ (6.30b) 
^o2 = ^Jfl (6-30c) 



One finds that the additional boundary terms introduced by this kernel gauge 
are (using (j^.lj) ) are of the same general form as ()6.1aj) . but with integration 
over some additional terms B^j'j on top of the usual B^^- Using ()6.1b|) . these 
new terms are 

Bf,{C) = (6.31) 
HC) |P(C) J«(C) + ip(C) E^?^ -l^^ {PiC)Tl^\C)) I A(C). 

I k ^ k ^ ) 

The point to be made is that while the identity A , which can be expressed 
in differential form as ()4.1|) . is zero, the integrands Bj^j for the boundary term 
expression B have a different form, and are not necessarily zero at all. This 
then may open the possibility that if originally (with no gauges), the boundary 
terms were nonzero, an appropriate choice of gauge can cancel these, again 
leading to i3 = 0, and no boundary term errors of the first kind. 
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6.3.2 Practical implementation using drift gauges 

Let us take the direct approach of changing P{C, t) to make the boundary terms 
B = 0. A difficulty is that the solution of P{C,t) is not known for non-trivial 
cases, so one cannot work with it directly. However, as pointed out in Section [6. 1.2[ 
moving singularities in the noiseless equations are very closely tied to boundary 
term errors of the ffist kind. Certainly in the sense that their presence indicates 
weakly decaying distribution tails, and hence boundary term errors. The drift part 
of stochastic equations can be modified practically at will using drift kernel gauges 
Qk (see Section r4.3|l . and in particular, the moving singularities can be removed in 
this way. As will be shown in examples of Sections 16.41 and 16.51 removal of these 
moving singularities does indeed appear to result in disappearance of the boundary 
term errors. 

Given a kernel amenable to drift gauges (i.e. possessing a global weight Q), and 
keeping in mind the symptoms of boundary term errors of the ffist kind outlined in 
Section 16.1.21 one wishes to choose drift gauges according to the following heuristic 
criteria: 

1. Moving singularities that were present in the original (ungauged) stochastic 
equations are removed. This can be assessed by analysis of the deterministic 
parts of the stochastic equations. See examples in subsequent sections. 

2. No new moving singularities are introduced. 

3. No noise divergences are introduced (in the dfl = ■ ■ ■ + Q J2k GkdWk evolution). 

4. No discontinuities in the drift or diffusion of any variables are introduced. 

5. Given the above criteria 1-4 are satisfied (which can be assessed by inspection 
or analysis of the gauged stochastic equations), one wants to minimize the 
weight spread in the resulting simulation, to optimize efficiency. 

A broadly applicable rule of thumb is to keep the gauges as small as 
possible while achieving the goal of removing boundary term errors. 

This can be aimed for using several complementary approaches: 
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(a) Make the gauge functions zero in regions of phase space where no correc- 
tion to the un-gauged drift is necessary. 

(b) Minimize a relevant variance of the weights. This weight spread has 
been considered in detail in Section 14.3.41 Depending on the situation, 
the quantity X]fc l^fcP (more generally) or J2k^^ (^°^ short time 
calculations of some observables) can be minimized, since the short-time 
growth of the variance of or Re {Q} (respectively) has been shown in 
Section 14.3.41 to be proportional to these quantities. 

(c) Ensure that drift gauges are zero at deterministic attractors in phase- 
space, so that no weight spread is accumulated when no significant evo- 
lution is taking place. 

(d) Minimize the variances of estimators of observables of interest. That is. 



the variance of quantities Tr 



OA 



appearing in the numer- 



and Tr 

ator of observable estimates ()3.14|1 . In this manner, one can optimize the 
simulation for the estimation of a particular observable O, rather than 
more generally as in point (a) above. 

6. One should also avoid gauges with the following features: 

(a) Gauges that are nonzero only in a rarely visited region of phase space. 
The problem is that good sampling of the gauged region is needed to make 
the correct change to observable estimates — otherwise, if the gauged re- 
gion is not visited by any trajectories, one has practically the same sim- 
ulation as without gauges! In a less extreme case where only a tiny (but 
nonzero) fraction of trajectories visit the gauged region, the correction 
from biased-ungauged to unbiased-gauged observable estimates is badly 
resolved, and the simulation results become very noisy. Furthermore, if 
the bias to be corrected is significant, then the change in the few gauged 
trajectories needs to be large to compensate for the bias, and hence the 
weight correction on those trajectories will become large as well. This 
leads to a large weight spread. It is much better to make small changes 
in many trajectories contribute to removal of the bias. 
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(b) Gauges rapidly changing in phase-space. These typically lead to stiff 
equations, and a gauged region of phase space that is badly sampled by 
reasonable-sized ensembles. This bad sampling arises when no single tra- 
jectory stays in the gauged region long, to such a degree that at a given 
time few (or, on average, possibly less than one) trajectories are sam- 
pling the gauged region, i.e. sampling the resulting observable estimate 
corrections. This results in similar problems as in the previous point (a). 

(c) Non autonomous gauges (i.e. depending directly on the weight Q). The 
problem with these is that they are usually much more difficult to analyze 
to check whether or not new moving singularities or noise divergences 
have been introduced. 

(d) Gauges analytic in complex variables if the kernel is also analytic in these. 
As discussed in Section 16.1.41 it is desirable to break the analyticity of 
the stochastic equations to avoid moving singularities. 

Note that an explicit variational minimization according to 5(b) or 5(d) could 
be in conflict with the recomemndations 6(a) or 6(b), in which case it is prob- 
ably best to try the simulations and see which works better. 

This technique appears to be broadly applicable, and only requires the recog- 
nition of what instabilities in the stochastic equations could lead to problems. It 
does not require detailed knowledge of what the boundary terms B are, nor of the 
behaviour of the far tails of P{C), provided instabilities are removed. 

6.3.3 Ito vs. Stratonovich moving singularities 

There is a subtle ambiguity that can arise when using the method just presented: 
Should one remove moving singularities from the Ito or the Stratonovich form of the 
stochastic equations? 

As indicated in Appendix El when one numerically integrates the stochastic 
differential equations using a finite but small time step At, the exact form of the 
drift that is integrated can depend on details of the integration algorithm chosen. For 
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example, if one evaluates the derivatives using the variable values at the beginning 
of a time step (i.e. simply those calculated using the previous step), then one 
integrates the standard Ito form of the equations. However, with a semi-implicit 
method, where one first estimates the variables in the middle of the time step, 
before using these to calculate the derivative and actually advance the simulation, 
one must integrate the Stratonovich form of the stochastic differential equations. 
The difference between these is given by the Stratonovich correction to the drift 
()B.7j) . In practice, multiplicative^ stochastic equations, such as those considered in 
this thesis, are more stable (and hence, efficient) using the semi-implicit method |82j. 

In some situations it may occur that moving singularities occur in the determin- 
istic parts of the Stratonovich equations but not the Ito equations (or vice- versa). 
To arrive at practical guidelines on this issue, one can argue in the following way: 

• Since both the explicit (Ito) and semi-implicit (Stratonovich) algorithm sim- 
ulate the same underlying equations, then to be sure that no bias is being 
introduced, moving singularities should not occur in either algorithm. In fact, 
a whole family of hybrid Ito- Stratonovich algorithms are possible where deriva- 
tives are evaluated at arbitrary time values during the time step At, and the 
difference in the drift terms between two such algorithms is a multiple G [0, 2] 
of the Stratonovich correction ()B.7|) — See Appendix El None of these algo- 
rithms should contain moving singularities, otherwise some kind of bias may 
be suspected. 

• The Stratonovich correction for variable Cj (1B.7jl is highly dependent on the 
form of the noise matrix elements (oc '^f^^Bi^idBji/dCk)- If one finds that 
moving singularities are present for one algorithm but not for another, this in- 
dicates that the Stratonovich correction is growing very rapidly as one heads 
for the boundaries of phase space. This in turn indicates that the noise ma- 
trix elements must be growing in some appropriately rapid fashion as well — 
possibly rapidly enough to cause noise divergences. 

^i.e. equations in which the noise terms are proportional to the variables, and hence of non- 
constant variance. 
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Taking the above points into consideration, it is conjectured here that: 

1. In a simulation that does not suffer from boundary term errors, there must be 
no moving singularities in the deterministic part of either the Ito, Stratonovich, 
or any of the hybrid Ito-Stratonovich forms of the stochastic equations. 

2. If a simulation has moving singularities for one of the family of algorithms (e.g. 
Ito), but not for some other member of this family, then noise divergences may 
be suspected as an additional cause of boundary term errors. 

3. To eliminate biases caused by moving singularities using a drift gauge, one 
must remove moving singularities from all hybrid Ito-Stratonovich algorithms 
with the same gauge choice. 

6.4 Removal example 1: Two-particle absorber 

Two examples will now be worked through — those systems in which boundary term 
errors in positive P simulations have been reported in published work. Removal of 
boundary term errors using the guidelines of Section l(i.3.2l will be demonstrated, and 
the procedure considered in some detail. The first example will be a single-mode 
two-boson absorber (in this section), while the second example of a single-mode laser 
will be treated in Section lfj.5l These two examples demonstrate somewhat different 
aspects of boundary term errors and their removal. 

This section is closely based on Section IV of the published article by Deuar and 
Drummond ^Hj ■ 

6.4.1 The single-mode model 

Consider a single optical cavity mode driven by coherent radiation, and damped by a 
zero temperature bath that causes both one- and two-photon losses. For convenience, 
units of time are chosen so that the rate of two-photon loss is unity. The scaled one- 
photon loss rate is 7, and e is the scaled (complex) driving field amplitude. Following 
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standard techniques, the system can be described with the master equation 

— = [ea^ — e*a, p] + — {2apa^ — a)cLp — pa^a) 

+ ^{2a^pa^^ - a^^a^p-pa^^a^) . (6.32) 

This is also the master equation of two-particle absorption in an interaction picture 
where kinetic processes have has been relegated to the Heisenberg picture evolution 
of the operators. 

Let us use the gauge P representation to convert this master equation into a set 
of stochastic equations. The single-mode kernel is written as 

A{C = {a,(3,Q} ) = fi||a)(/?*||e-°^. (6.33) 

Following the treatment of Sections 13. 4| 14. 3| and 15. 3^ and converting to Stratonovich 
form using ()B.7|) . one arrives at the Stratonovich stochastic equations 

da = [e - a{af3 + ig + {-^ ~l}/2)]dt + iadW (6.34a) 
df] = [e* -(3{a(3 + ig + {-r-l}/2)]dt + i(3dW (6.34b) 

(6.34c) 

Here S^dt is the appropriate Stratonovich correction term given by ()B.7|) . which 
depends on the functional form of the particular gauges chosen. 

With no gauge {Q = Q = 0), the positive P Stratonovich equations are recovered: 



dQ 



Sndt + Q 



gdw + gdw 



da = [e -a{ap + {j-l}/2)]dt + iadW (6.35a) 
d(3 = [e* -(3{a(3 + {-f-l}/2)]dt + i(3dW . (6.35b) 

This two-particle loss model can give either correct or incorrect results when 
treated with the usual positive P representation methods. Generally, problems only 
arise when any accompanying single-particle losses have small rates or when the 
number of bosons per mode is small (see Figure 16. 3j) . It is a well-studied case, 
and a detailed treatment of the boundary term errors that appear can be found in 
It also has the merit that exact solutions can be readily found using other 
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means. Analyzing this example gives a good check that the moving singularity 
removal procedure of Section lfi.3.21 does in fact eliminate boundary terms and leads 
to correct results. Several simplifications of this model will now be concentrated on, 
which correspond to existing literature. 

6.4.2 Two-boson absorber 

In the simplest form of this model, corresponding to 7 = e = 0, only two-boson 
absorption takes place. One expects that for some state lip) = J2'^=o^n\n) in a Fock 
number state basis, all even boson number components will decay to vacuum, and 
all odd-numbered components will decay to |1), leaving a mixture of vacuum and 
one-boson states at long times. 

The positive P representation has been found to give erroneous results jZOllHnilHll 
IHSI accompanied by moving singularities [M] ■ There are power-law tails in the distri- 
bution that lead to boundary term errors — consistently with the conditions ()6.7|) . 
The observable usually concentrated on in this system is the occupation number 
n = a^a, which is estimated by n = (a/5 = 'n-)stoch positive P representation. 

The variable n = a(5 has a convenient closed equation (Stratonovich), 

dn = -2n{n + ig^ -1/2) dt + in dW^ (6.36) 

with dW+ = {dW + dW), and 0+ = {G + g)/2. 

Let us examine the behavior of the above equation when = 0, i.e., in the 
standard, un-gauged formulation. The deterministic part of the evolution has a 
repellor at ri = 0, and an attr actor at h = 1/2. The noise is real, finite, and of 
standard deviation dt/2 at the attractor. The deterministic part of the evolution 
has a single trajectory of measure zero that can escape to infinity along the negative 
real axis, 

«(t) = -m = ^ _ , (6.37) 

Y 2{tsing t) 

where tsing = l/2a;(0)^ = — l/2n(0). This moving singularity is a symptom of likely 
boundary term errors. 

Indeed, in the steady-state limit, all trajectories in a simulation will head toward 
h = 1/2, with some remnant noise around this value being kept up by the stochastic 
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term. This noise does not affect the average, however, and the estimator of (n) 
becomes hmf^oo?^ = linit^oo (^)stoch ~ 1/2 in the many trajectories hmit. Quantum 
mechanics, however, predicts that if we start from a state p(0), the steady state will 
be 

oo 

hm (n) = 5^ (1 + 2j| p(0) |1 + 2j) , (6.38) 

t— >oo ' 
j=0 

since all even occupations decay to vacuum, and all odd to |1). For a coherent state 
\ao) input, say, this will be 

lim(n) = - fl - e-^l^ol') . (6.39) 

Thus one expects that the positive P simulation will give correct results only when 
el"«l' > 1. 

To correct the problem one has to change the phase-space topology in some way 
to prevent the occurrence of such moving singularities. It was found that a good 
gauge for a two-boson absorber nonlinearity in general is 

g = g = g+ = i{n-\n\) . (6.40) 

This will be dubbed the "circular" gauge due to the creation of an attractor at 
|n| = 1/2. 

This gauge has the effect of replacing the — 2n^ term in ()6.36|1 with — 2rl|n|, which 
is always a restoring force, and so never leads to any escape to infinity in finite time. 



With the gauge ()6.40p . the Stratonovich equations become 

dn = n{l-2\n\)dt + indW+ , (6.41a) 
da = -a{\n\ - 1/2) dt + iadW , (6.41b) 
dn = n { [n + {n - \n\)'^] dt + i{n - \n\) dW^} . (6.41c) 

How do these new equations measure up to the four symptoms of boundary term 
errors of the first kind outlined in Section 16.1.21 .'' In logarithmic variables ai = log a, 
riL = logh, and zq = logQ, the Stratonovich system equations are 

driL = {I - 2\n\) dt + idW^ (6.42a) 
daL = -{\n\- 1/2) dt + idW (6.42b) 
dzo = [h + {h - \h\)^] dt + i{h - \h\) dW+ . (6.42c) 
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One now has an attr actor on the circle \n\ = 1/2, and a repellor at n = 0, with 
free phase diffusion of n in the tangential direction. Once trajectories reach the 
attractor, only phase diffusion occurs. So: 

1. Moving singularities: There are no moving singularities in ni, since \n\ always 
remains bounded. Also, since the deterministic evolution of the remaining 
independent variables ai and zq depends only on fi or \h\ in such a way that 
daL and dzo remain finite for finite n, then these variables never escape to 
infinity in finite time (being just a time integral of finite values of n). Thus, 
no moving singularities occur. 

2. Noise divergences: Similarly because the noise coefficient of ra^ is constant, 
while the noise coefficients of and Zq depend only on h, and in such a way 
that for finite n the noise magnitude remains finite — then one concludes that 
noise divergences are not present either. 

3. Discontinuous drift or noise: Not present, by inspection. 

4. Initial distribution: Initial coherent state has -P(O) = 6'^{a — q;o)5^(/3 — a*)6'^{Q — 1). 
There are no boundary terms at the outset with such an infinitely-peaked dis- 
tribution. 

How does this change in the Ito formulation? Using ()B.7|) on ()6.42|) . one finds that 
the Ito and Stratonovich equations are identical for and a^, while the Ito weight 
equation is dzo = {n— \n\)'^dt + i{n ~ \n\) dW^. The above arguments apply without 
change to Ito, and to hybrid Ito-Stratonovich algorithms. So — no symptoms of 
boundary term errors remain in any formulation. 

Let us see how the gauge P simulation compares with the positive P in practice 
— i.e. are boundary term errors really removed along with the moving singularities? 
Figure compares results for an (exact) truncated number-state basis calculation, 
a positive P calculation, and a gauge calculation using the circular gauge ()6.40|) for 
an initial coherent state of ao = 1/ V^- Figure 16.31 compares steady-state values for 
exact, positive P, and gauge calculations for a wide range of initial coherent states. 
It is seen that the gauge calculation is correct to within the small errors due to 
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t (scaled time) 



Figure 6.2: Comparison of two-boson absorption simulations: time-varying mode oc- 
cupation. CIRCLES: positive P simulation; solid line: gauge simulation using circular gauge 
H6.40|l : DASHED LINE: exact calculation (truncated number-state basis). Simulation parameters: 
iS = 4 X 10* trajectories; initial coherent state. 

finite sample size, whereas the positive P calculation leads to severe errors at low 
occupations. 

6.4.3 One- and two-boson absorber 

If one now considers single-boson decay as well, but still with no driving, one expects 
that all states will decay to the vacuum on a time scale I/7 on top of the two-particle 
losses, which produce decay of particle numbers on a timescale of 1 (although not 
necessarily right down to the vacuum.) When 7^1, the single-particle decay 
dominates and nothing interesting is seen beyond an exponential decay of particle 
number. However, if 7 < 1, one should first see a rapid decay to a mixture of 
vacuum and one-boson states due to the two-boson process, and then a slow decay 
of the one-boson state to the vacuum on a time scale of t ~ I/7. 

In this case the positive P (Stratonovich) equations display different behavior 
depending on whether 7 is above or below the threshold 7 = 1. Below threshold, 
there is an attractor a.tn = (1 — 7)/2, and a repellor at n = 0, while above threshold, 
the attractor is at = 0, and the repellor atn = — (7 — 1)/2. In either case, there is a 
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{n{t = 0)) 

Figure 6.3: Steady state expectation values of boson number (n) obtained by gauge 
simulations (double triangles) compared to exact analytic results H6.39|l (solid line) and 
positive P simulations (circles) for a wide range of initial coherent states. Size of uncertainty in 
gauge results due to finite sample size is indicated by vertical extent of 'double-triangle' symbol. 
Steady state was observed to have been reached in all simulations by t = 3.5 or earlier (compare 
with Figure and . hence this is the time for which the simulation data is plotted. S = 10^ 
trajectories 
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divergent trajectory along the negative real axis, which is again a moving singularity. 
It turns out that the steady state calculated with the positive P is erroneous while 
7 < 1, and that there are transient boundary term errors while 7 < 2 (TO]. The false 
steady state below threshold lies at the location of the attractor: (1 — 7)/2. 

Let us try to fix this problem using the same circular gauge ()6.40|) as before. The 
gauged equation for is now 

driL = {I - - 2\n\) dt + idW+ , (6.43) 

while the a and Q evolution is unchanged. So, above threshold one is left with only 
an attractor at n = 0, while below threshold one has a repellor at n = surrounded 
by an attracting circle at \n\ = (1 — 7)/2. This phase space again has no moving 
singularities or noise divergences. 

The results of simulations for the parameter 7 = 0.1 are shown in Figure 
(7^1 was chosen so that a system with two widely differing time scales is tested.) 
The gauge simulation tracks the exact results and avoids the false results of the 
positive P simulation. Note also that the gauge simulation remains efficient for a 
wide range of occupation numbers — from (n) ~ 100 ^ 1, where the positive P is 
also accurate, to (n) ~ 0.1 -C 1 where it is totally incorrect. 

6.4.4 Driven two-boson absorber 

The other type of situation to consider is when there is a driving field as well as 
two-boson damping. In this Subsection the coherent driving e is nonzero, but the 
single-particle loss rate is assumed negligible (7 = 0), since this process never causes 
any of the simulation problems anyway, but leaving it out simplifies analysis. Failure 
of the positive P representation method has been found in this limit as well [7T] , and 
is evident in Figure The equation for n is no longer stand-alone in this case, and 
all three complex variables must be simulated as in ()(j.34cj) . the Q equation being 
the same as in the undriven case ()(i.41|l . 

A treatment of the singular trajectory problem with the same circular gauge ()6.40|) 
leads again to correct results, as seen in Figure 16.51 
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Figure 6.4: Comparison of simulations for system with both one- and two-boson 
damping. Relative single-boson damping strength 7 = 0.1; Circles: positive P simulation. 
SOLID LINE: simulation using the circular gauge H6.4U|I : dashed line: exact calculation (truncated 
number-state basis). Gauge simulation parameters: 100 000 trajectories; initial coherent state |10) 
with (n) = 100 bosons. 

6.4.5 Relevance to interacting Bose gas 

The master equation ()6.32|1 was first considered in the context of quantum optics. 
There, single-photon losses are usually large relative to two-photon processes, and 
mode occupations are high, so the boundary term errors have not been a practical 
problem in their original quantum optics regime. However, for other physical systems 
such as interacting Bose gases (which this thesis is most interested in), two-particle 
losses can be stronger, one expects a large number of modes to have low or practically 
zero occupation, and so the boundary term errors may be significant. For example, 
collisional many-body processes that remove particles in pairs from the modeled 
field \E', will lead to similar two-particle loss terms ()2.24j) in the master equation. 

6.5 Removal example 2: Single- mode laser 

Let us now consider the second quantum system for which systematic errors have 
been seen with the positive P representation. The mechanism by which boundary 
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t (scaled time) 

Figure 6.5: Driven two-boson absorber with e = 0.05. Circles: positive P simulation 
(1000 trajectories); solid line: circular gauge (|6.40() simulation (100 000 trajectories); dashed 
LINE: exact calculation (truncated number-state basis). Initial vacuum state. 

term errors occm here is somewhat different than in the two-boson absorber. For 
two-boson damping, boundary term errors occur even when one chooses an optimal 
(i.e., compact) initial distribution to represent the starting state |ao) (c^ol, whereas 
for this laser model the systematic errors occur only if one starts with unusually 
broad initial distributions. Nevertheless, such broader distributions may arise during 
time-evolution from some other state, so consideration of what occurs for significant 
distribution broadness is also relevant to practical simulations. 

This section is closely based on Section IV of the published article by Deuar and 
Drummond . 

6.5.1 The laser model 

For a simple model of a photonic or atomic laser, using the single-mode positive P 
representation, 

A{C = {a, (3} ) = \\a) je""'^, (6.44) 
one can derive [7T| IM] the the Fokker-Planck equation 



Section 6.5 Removal example 2: Single-mode laser 



147 



where the average number of photons in the mode is n = (a/5)sto(,h' A/" is a 
scahng parameter that equals the number of atoms in the gain medium. Both G, 
the gain parameter, and Q > G/Af, the noise parameter, are real and positive. One 
usually considers the scaled variables 



a = a/VAf (6.46a) 
p = p/VX, (6.46b) 



The stochastic Ito equations are then ffTj IM] 



da = {G - a(3) adt+ y/2Q dr] (6.47a) 
d/3 = {G -ap)/3dt+ ^/2Qd7]*, (6.47b) 



where dr] is a complex Wiener increment obeying {drj* drf) ^^^^^ = dt, {dr])^^^^^ 
One is again usually interested in the (scaled) occupation of the lasing mode 



(n,c = ap) = {a^a)/U, (6.48) 

\ / stoch 



n. 

/ stoch 

i.e. the number of bosons in the lasing mode per each atom in the gain medium. 
Changing variables to and an auxiliary "phase-like" variable Z2 = a/P, the FPE 
becomes 

^ = 2-^ [Q + n3e(nsc - G)] + 2Q^^ + 2Q-^n,, - 2Q^^ P. 
dt [ dusc dz2 Z2 dnj^ oz2n^c) 

(6.49) 

The diffusion matrix is now decoupled 



4Q 



nsc 
-zlln^. 



(6.50) 



and so the two complex variables can evolve under independent noises. In fact the 
entire evolution decouples from Z2, and can be written as the closed Ito equation 



dn^c = -2(nsc - a)(nsc - b)dt + , (6.51) 
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where now the real Wiener increments (implemented as Gaussian noises) have vari- 
ance ('iW^^)stoch ~ deterministic stationary points are 

a= 1(g+ ^G^ + 4q) > (6.52a) 
b= 1(G- v/G2 + 4g) < 0. (6.52b) 



(In the Stratonovich calculus, [^] = |(G± a/G^ + 2Q) ). One finds that the station- 
ary point at a is an attractor, and at b there is a repellor. Defining 
as the distance (in the negative direction) from the repellor, one obtains 

= 2n,eg(nneg + + AQ) + noise , (6.53) 

which shows that there is a singular trajectory escaping to infinity in finite time 
along the negative real axis Usc < b — i.e. a moving singularity. 

6.5.2 Initial conditions 

Let us consider the case of vacuum initial conditions. A vacuum can be represented 
by the positive P representation 

P+{a,l3) = 6\a) 6\j3) , (6.54) 

but also by Gaussian distributions of any variance ctq, around the above. 

It can be checked that this satisfies the vacuum positive P representation condition 
()3.34j) . giving another family of equivalent vacuum distributions different to ()3.35|) . 
which was found in Section [3. 3. 61 Note: the distribution of Usc is non-Gaussian, but 
has a standard deviation of an^c ~ V^ctq both the real and imaginary directions. 

It has been found by Schack and Schenzle [llj that for this single-mode laser 
model, a positive P simulation of pumping from a vacuum will give correct answers 
if the usual (5-function initial condition ()6.54|) is used, but will have systematic errors 
if an initial condition ()6.55|1 has a sufficiently large variance (see Figure IFTHjl . 

This can be understood by inspection of the stochastic equation ()6.51|) . If 
one has a sufficiently broad initial distribution, that some trajectories start with 
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Re {usc} < b, then the singular trajectory will be explored by the distribution. 
From this it appears that for all cxq > 0, some boundary term errors may be present. 
In practice, no errors were seen by Schack and Schenzle for small enough (Tq, pre- 
sumably because their magnitude was below the signal-to-noise resolution obtained 
in those simulations — an interesting effect in itself. Additionally, even if initially 
no trajectories fall on the singular trajectory (e.g. some initial distribution that is 
exactly zero for Re {rise} < b), the region Re {usc} < b may be subsequently explored 
due to the presence of the noise terms. 

Apart from the obvious 5-function initial condition, one might want to try the 
canonical distribution ()H.1()|1 applicable generally to any density matrix. For optical 
laser models, this will not cause practical problems as then the variance is ctq = l/A/", 
which for any realistic case will be very small (i.e., o"„^^ ^ Hence, boundary term 
errors would be negligible for a wide range of simulation times, based on the evidence 
that they were not seen [3] in this regime. Nevertheless, broader distributions may 
arise during time-evolution of some other state to a vacuum, so consideration of 
what occurs for significant ctq is also relevant to practical simulations. 

Incidentally, the anomalous results were discovered in [21] when ctq = 1 was 
chosen by the erroneous procedure of scaling the equations while not simultaneously 
scaling the canonical initial condition in a. 



6.5.3 Drift gauged equations 

Let us introduce the single-mode gauge P kernel ()5.4|) (compare with ()6.44|) . which 
differs only by lack of the global weight factor e^°). Proceeding to introduce drift 
gauges as in Section lOl (for rise but not for Z2, as averages of this variable will not 
figure in what follows), one obtains the Ito stochastic equations 



dn^c = 2n,c{G -n,^)dt + 2Qdt + 2^/Qn^{dW -gdt) (6.56a) 
dzo = -^g^dt + gdW, (6.56b) 



150 



Chapter 6 Boundary term errors and their removal 



as per the standard gauge formulation ()4.90|1 . It is convenient to define a transformed 
gauge function 

Gin)=Q\l^ (6.57) 

V ^sc 

, which is also arbitrary and enters into the simulation as 



drisc = 2nsc{G — — Q{n)) dt + 2Q dt + 2^/Qn^dW (6.58a) 



2 



dzo = + (6-^^^) 

6.5.4 Correcting for the moving singularity 

Consider the (Ito) deterministic evolution of, n'^^, the real part of = n'^c + ^^so 

= -2n'J + 2Gn',^ + 2Q + 2nl^ - 2Re {n,,Q(rC)} • (6.59) 

The moving singularity is due to the —2ri!^^ leading term for negative values of v!^^. 
The drift gauge is now chosen according to the criteria below (as in Section I6.3.2| 
but in a different order): 

1. As pointed out in Sections I4.H.4I and I6.H.2| it is desirable to keep the gauge 
terms to a minimum because whenever they act the weights of trajectories 
become more randomized. Thus, let us restrict ourselves to functions Q{n) 
that are only nonzero for v!^^ < 0. 

2. There should not be any discontinuities in the drift equations, as this might 
lead to bad sampling of some parts of the distribution and thus possibly to 
systematic biases. A further problem with discontinuous gauges is that the 
Stratonovich correction term for dzo cannot be calculated at the discontinu- 
ity, preventing use of the more stable semi-implicit numerical algorithm (See 
Appendix IB|1. 

Hence, in particular, in light of the previous point 1., one should have lim„/ _^o( G{n) 
0. For ease of analysis, let us start with a simple form for the gauge, = 
c — \n'^^ + Xyfi'^c- Continuity immediately implies c = \y = 0, hence 

, -An' if n' < 
Qin) ={ , (6.60) 

if nL > 
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3. The next necessary (and most important) condition, to remove moving singu- 
larities, is that the —2n'^^'^ term is canceled, hence: 

A>1. (6.61) 

Using ()B.7j) . the Stratonovich correction for dngc is —Qdt, and so the family 
of hybrid Ito- Stratonovich equations differ only in having this term a multiple 
of Q dt in the range [0, 2Q dt]. This has no effect on the removal or not of the 
moving singularity. 

4. Now, if A = 1 there are no systematic errors, but the sampling error very 
quickly obscures any observable estimates because n'^^ still heads to — oo ex- 
ponentially fast due to the 2Gn'g^ term. This takes it into regions of ever 
increasing |^(ra)|, and weights quickly become randomized. For slightly larger 
parameters A, the n'^^ evolution takes trajectories to a point lying far into the 
negative n'^^ region where the two leading terms balance. Here the trajecto- 
ries sit, and rapidly accumulate weight noise. It is clear that for an optimum 
simulation all stationary points of n'^^ in the nonzero gauge region must be 
removed. In this system the condition for this is 

A > 1 + |, (6.62) 

for the Stratonovich calculus, and A > 1 + G^/4Q for Ito. 

5. Are any new moving singularities or noise divergences introduced? The gauged 
equation for the imaginary part of is 

= 2<JG + <(A - 2)] dt + 2 I I sin il^n,,) dW, (6.63) 

which is attractive towards n"^ = when A > 2 in the gauged {n'^^ < 0) region, 
and exponentially growing otherwise. In either case, no super-exponential es- 
cape occurs, and no moving singularities. The evolution of zq depends only on 
finite expressions of finite nsc, is thus an integration over the history of a tra- 
jectory, and so does not diverge in finite time. The Stratonovich correction for 
dzQ is A (rig^ -|- nsc + |?t,sc|) /2 in the gauged region, zero otherwise. Introduction 
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12 3 



t (scaled time) 

Figure 6.6: One-mode laser G = 1, Q = 0.25. Dashed line: (correct) positive P simulation 
with (5-fmiction initial conditions H6.54|l CTq = 0, and 100000 trajectories. Dotted-dashed line: 
erroneous positive P simulation with Gaussian initial conditions (|6.55|) CTq — 0.1 initially, and 
100000 trajectories. Dotted line: erroneous positive P simulation with CTq = 1, and 10000 
trajectories. Solid line: gauge calculation for ctq =0.1 with A = 4, which corrects the systematic 
error of the positive P. Only 4000 trajectories, so as not to obscure other data. 

of such a term or its multiple does not lead to any moving singularities for 
Zq evolution either. One concludes that no new moving singularities or noise 
divergences are introduced by the gauge ()6.60|1 . 

The results for an example have been plotted in Figure 16.61 The parameters 
there were G = 1, Q = 0.25 (leading to a ^ 1.1124 and b ^ —0.1124 in Stratonovich 
formulation ). The initial condition in the example was cXq = 0.1, which is already 
much larger than the canonical variance for physically likely parameters in a photonic 
laser. Typical values of \nsc\ initially will be of order o"„^^ ~ 0.14 > |6| here. A good 
choice of gauge was found to be A = 4. The use of this gauge restores the correct 
results. 

6.5.5 Non-optimal initial conditions 

As one increases the spread of the initial distribution beyond an^^ ~ \b\, it becomes 
increasingly difficult to find a gauge that will give reasonable simulations. For ex- 
ample a wide variety of what seemed like promising gauges for ctq = 0.3 have been 
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tried (with the same values of parameters Q and G as in Figure I6.6|l , and none have 
come close to success. The problem is that while systematic errors are removed, 
large random noise appears and obscures whatever one is trying to calculate. 

Trajectories that start off at a value of n'^^ lying significantly beyond b require a 
lot of modification to their subsequent evolution to (1) stop them from escaping to 
— oo and (2) move them out of the gauged region of phase space so that they do not 
accumulate excessive weight noise. If there are many of these, the trade-off between 
the gauge size and length of time spent in the gauged region does not give much 
benefit anymore. Some consolation is gained in knowing that results will at worst 
be noisy and unusable, rather than being systematically incorrect. 

6.6 Summary of main boundary term results 

The two mechanisms that lead to boundary term errors have been specified and 
investigated for general phase-space distribution methods: 1) Errors of the first kind, 
which are caused by discarding nonzero boundary terms in the partial integration 
step between the master and Fokker-Planck equations, and 2) Errors of the second 
kind, which can arise if the estimator of a particular observable for a given kernel 
grows too rapidly as the boundaries of phase-space C are approached, and may be 
present or absent depending on the moment chosen. 

The exact expression for the first kind of boundary terms has been found: (jfj.lll . 
and the integrals that must converge for moment calculations to avoid the second 
kind are given by ()6.8|) . While these cannot in general be explicitly evaluated for 
non-toy models because the exact form of the distribution P{C) at the boundaries 
of phase space is required, a simple example has been solved for instructive purposes 
in Section [6.2.21 and these boundary term expressions explicitly evaluated. For the 
gauge P representation, conditions on P{C) ()6.7|) and ()6.24j) are found, which are 
sufficient to avoid boundary term errors of the first and second kind (respectively). 
In particular, for the gauge P representation: 

1. The simulation will be free of boundary terms of the first kind when: A) Kernel 
drift gauges are polynomial in the system variables or their conjugates, and B) 
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the far tails of P{C) decay faster than exponential in the coherent amplitudes 
and faster than power law in Q. 

2. Estimators of observables polynomial in the and operators are found to 
be free of boundary term errors of the second kind provided that the far tails 
of -P(C) decay faster than a power law. 

This is consistent with the results found by Gilchrist et a/ [HI]. No indication that 
such biases will occur for interacting Bose gas simulations is found in Section [6.2.41 

In Section I6.1.2[ four symptoms of boundary term errors of the first kind are 
pointed out. These are moving singularities, noise divergences, discontinuities, and 
excessively broad initial conditions. All four can be checked for by inspection and 
some analysis of the stochastic equations and initial condition, before any simulation. 
Explicit conditions (jfi.Hj) have been found for simulations on open complex phase- 
spaces, which are sufficient to ensure that neither moving singularities nor noise 
divergences occur. It is also noted that by the Painleve conjecture, boundary term 
errors of the first kind are likely to be generically present in ungauged many-mode 
simulations based on analytic kernels. 

All the above results have been collected here with the aim of systematizing 
and reducing the confusion on the topic of boundary term errors. Subsequently, 
Section lfi.H.2l introduces a broadly-applicable heuristic method by which the symp- 
toms of boundary term errors of the first kind can be removed using drift kernel 
gauges (and hence, presumably, also the boundary term errors as such). This re- 
moval method is applied to the two cases in the literature where systematic bias has 
been reported when using the positive P representation: two-boson absorption, and 
a single-mode laser model. It is found that the systematic bias is indeed removed 
from simulation results by this method for these examples. 
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Chapter 7 

Gauges for single-mode interacting 
Bose gas dynamics 

7.1 Motivation 

Simulations of the dynamics of multi-mode (locally) interacting Bose gases (with 
lattice Hamiltonian ()2.17p ) using the positive P representation suffer two major 
technical problems caused by instabilities in the stochastic equations: 1) Exponen- 
tial growth of distribution broadness leading to rapid loss of any useful accuracy. 
2) Probable moving singularities when two or more modes are coupled (See Sec- 
tion [7]TS1)- These issues prevent simulation of all but short time behaviour. Gases 
with non-local interactions (Hamiltonian ()2.12j) ) are expected to suffer from similar 
problems on the basis that the locally-interacting lattice model is a special case of 
these more general models. 

The instabilities arise from the two-body interaction terms in the Hamiltonian, 
and so for a locally-interacting model the unstable processes decouple and are local 
to each spatial lattice mode of the model. Hence, if the instabilities are brought 
under control for each mode on its own, then simulations of the full many-mode 
model should benefit as well. 

In this chapter, a single-boson mode with two-body interactions is considered, 
and gauges (kernel drift and diffusion) are introduced to make improvements. A 
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drift gauge can remove the offending instability, while the diffusion gauge makes a 
tradeoff between noise in the phase-space (i.e. a and (3) equations and noise in the 
global weight zq, and will be optimized to improve simulation times. In Chapter |H1 
the performance of the method developed here will be investigated in a coupled 
two-mode system as a prelude to multi-mode simulations. 

7.2 The single- mode interacting Bose gas 
7.2.1 Physical model 

Consider a single mode extracted from the open multi-mode locally-interacting lat- 
tice model of Section 12.51 Annihilation and creation operators for the mode are a 
and a\ and obey [a, a"!"] = 1. From ()2.17|1 . and with possible coherent gain added as 
in ()2.25|) . the Hamiltonian is 

H = hn {uj + x[n - I]) + ih{ea) - £*a), (7.1) 

The master equation of this system interacting with an environment is given by the 
usual Linblad form ()2.20|) . with Linblad operators 

Li = 0^/7(1 +^bath) (7.2a) 
L2 = aV7^bath (7.2b) 

modeling single-particle interactions with a standard boson heat bath with a mean 
number of particles nbath(7') per bath mode, as in Section 1^31 

Physically this model approximates a single mode of interest n of a multi-mode 
system with two-body scattering, where evolution of the other modes m 7^ n is 
assumed negligible. Roughly, the linear self-energy term becomes u = ujnn, highly 
occupied coherent modes m can be collected into e = —iJ2m^nm{^m), while the 
remainder of modes can become the heat bath. The single-mode model can also 
sometimes represent an approximation to a single orbital of a more complex system, 
if the constants u,e, nbath and x are chosen appropriately. 
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7.2.2 Stochastic gauge P equations 



A single-mode gauge P kernel ()6.33|) is used, where the complex configuration vari- 
ables are coherent state amplitudes a and (3, and a global weight Q = e^°. From 
the multi-mode equations ()5.17|1 . ()5.18|) . and ()5.2H1 of Section ESI the gauge P Ito 
equations for this model are 



da = —i(jjadt — 2ixQ^(3dt——adt + edt + '^^B_ik{dWk — Gkdt) (7.3a) 

k 

dp = iujl3dt + 2ixl3'^adt-^(3dt + e* dt + ^B2k{dWk-Qkdt) (7.3b) 



dn = nJ2^kdWk, 



(7.3c) 



where the pre-drift-gauge noise matrices B_jk defined as in Section 14.3.11 and drift 
gauges Qk have not been specified yet. The dWk are independent Wiener increments, 
which can be implemented by Gaussian noises of variance dt. The elements of the 
2 X Nw complex noise matrices 5 satisfy 



Dn = Ek Blk = -2^X«' 
D22 = Ek Blk = 2^X/3' 

Dl2= EkB.lkB_2k =7^bath(T). 



(7.4a) 
(7.4b) 
(7.4c) 



The Nw complex drift gauge functions Qk are arbitrary in principle. 

The standard gauge formulation ()4.90p will be used, with the proviso that noises 
related to the bath interaction and the interparticle interactions are chosen to be 
independent to simplify the equations as outlined in Section 14.4.71 That is, using 
diffusion matrix D_ = B_B^ = Ei ^^''^ with 

2 



D^^) = 2ix 



-a- 
f3^ 

one obtains the square root noise matrix forms 

ia 
(3 



^^'^ =7^bath(T) 



B 



(1) 



B 



(2) _ /7^bath(r) 





1 

1 
1 



1 



i 



(7.5) 



(7.6) 



In Section 17.2.41 it will be shown that the process responsible for instabilities 
is the two-particle collisions (parameterized by x)? and so improvements should be 
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searched for by gauging this process, not the bath interactions (parameterized by 
7). Thus, the standard gauges are apphed as: 

1. Imaginary diffusion gauges g"/^ as in Section 14. 51 Recall from Sections I4.4.HI 
and l4.4.4l that only imaginary standard diffusion gauges can alter the stochastic 
behaviour of the simulation, so only these will be considered. One obtains 



B 



BS^O{g'l,) Bf 



(7.7) 



with an orthogonal matrix O of the form ()4.68p dependent on a single imagi- 
nary diffusion gauge gi2 = ig" with real g" . 

2. Complex drift gauges Qi and with the remaining Qk>2 = 0. 

The two- variable diagonal diffusion gauge expression ()4.71|) applies here, and the Ito 
stochastic equations become 

da = —iua dt — 2ixa'^P dt — dt + edt + a/ 7^bath'^'7bath 

+ia^/2i^[{dWi - Gi dt) cosh g" + i{dW2 - G2 dt) sinh g"] (7.8a) 

7 



d(3 = iu(3 dt + 2ix0^a dt - -/3dt + e* dt + ^/Trvb^d■r]l^^^ 

+f3^J%c [-i{dWi - Gi dt) sinh g" + {dW2 - G2 dt) cosh g"] (7.8b) 
dn = n {GidWi + G2dW2} (7.8c) 

in terms of the real Wiener increments dWi, dW2 implemented by Gaussian noises 
of variance dt, and the complex stochastic Wiener-like increment drf^s^th obeying 

(c?r/bathC^r/bath)stoch = and (c/??bath)stoch = 0' 



7.2.3 Anharmonic oscillator 

Consider the gain-less system (rrbath(7') = e = 0), which contains all the essential 
features of the two-body interactions. This system is known in the literature as the 
damped anharmonic oscillator. 

For this toy system, the observables can be solved analytically, and comparing 
simulations with these will be of great use here to optimize diffusion gauge choices. 
In particular, consider coherent state projector initial conditions |ao)(/5o|, which 
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correspond to the configuration of a single trajectory out of the S system samples 
that together can represent any arbitrary quantum state. Let us concentrate on the 
mean particle number 

{n) = (a^a), (7.9) 
and the first-order time correlation function^ 

G(i)(0, t) = f3o{a) = a*o{a^y, (7.10) 

which contains phase coherence information. Normalizing by (q;o|/?o) — Tr [ |ao) (Pol ]■ 
their expectation values are found to be 

(n) = noe-^\ (7.11a) 
GW(0,t) = noe-^'/'e-'-'exp{ >^ (e'^^xt-T* _ i) 1 (y.ub) 

where uq = aoPo. 

When the damping is negligible, no real, and the number of particles is no ^ 1, 
one sees that the initial phase oscillation period (ignoring u) is 

W = — sin-M — , 7.12 

and the phase coherence time, over which |G'*-^-'(0, t)| decays is 

W = ^ cos-' (l-^)^ -i— . (7.13) 
2x V 2noy 2x^/n^ 

The first quantum revival occurs at 



TT 



^revival • ('''•-^4) 

X 

^This first order correlation function is usually written in the Heisenberg picture as G'^^^O, t) = 
(a^ {0)a{t)) . Moving to the Schrodinger picture in which the gauge P representation is de- 



fined, one has G^^^ = Tr 



JHt/h-pf -iHt/h 



p(0)at 



Because of the coherent state conditions 



used here, p(0) ~ A(q;o,/3o,^^ — !)■ Thus, p{0)a.^ = (3op{0) using Ij5.7d|) . This gives 
G*^^-* = /3oTr ae^''^*/'^p(0) e'-^*/'* , and identifying the Schrodinger picture density matrix as 

p{t) = e~*^*/'"'p(0) e*^*/'^, this gives the desired expression G^^'>{0,t) = /3o(a)- To obtain the 
right-hand side expression in (|7.1()(l . note that G*^^-* is a complex quantity composed of two ob- 
servables (see (I7.2()|l 'l — one for the real, and one for the imaginary part. Taking the adjoint, 
G''^\0,t)* = (at(<)a(0)) in the Heisenberg picture, and following the same procedure as above, one 
obtains G^^\OA)* = ao(a^). 
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The stochastic equations for this gain-less system are more convenient in terms 
of the logarithmic number-phase variables 

riL = log(a/?) (7.15a) 
rriL = log ( "5 ) ■ (7.15b) 



For coherent states where f3 = a* , ni is the logarithm of the mean number of parti- 
cles while rriL = 2iZa, and characterizes the phase of the coherent amplitude. 
The Ito equations in the new variables are 

duL = —7 dt + li^fixe^^ [dr] — Qt^^) dt) (7.16a) 
drriL = -2i(cu-x + 2xe"^)c/t + 2iv^e^"(rfr/*-^(^)rft) (7.16b) 
dzo = g(rn)d^ + Q(n)dr]* - Q{.a)Q{m)dt = d{\ogVt), (7.16c) 

where the complex Wiener-like increment dr] = (dWi — idW2)/V2 has variances 

{dvdvlstoci. = dt (7.17a) 



and the independent transformed complex drift gauge functions are 
^ Gi- iQ2 ^ Qi + iQ2 



(7.18a) 



Note: these are not necessarily complex conjugates since the original Qi and Q2 are 
in general complex. 

7.2.4 Behaviour of the positive P simulation 

Lack of moving singularities 

Consider the anharmonic oscillator equations ()7.16j) in the positive P representation, 
where ^fc = 0, and g" = 0. By comparison with the condition ()6.3p . one can check 
by inspection that moving singularities do not occur : dn^ does not depend on 
any variables and so is stable, while drriL depends only on the other variable rii. 
Thus the deterministic behaviour of is just integration of a finite function of 11^ 
(which itself remains finite). The noise in is of constant magnitude. Zq does 
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not evolve. All variables remain finite, no moving singularities or noise divergences 
occur. Therefore, none of the symptoms of boundary term errors from Section r6.1.2l 
are present in the equations. 

Indeed, the investigations of Gilchrist et al jHl] found that boundary term errors 
do not occur for the special case of the single-mode anharmonic oscillator. 

Rapid growth of statistical error 

If the observable estimators converge to the quantum mechanical expectation values 
in the limit of many trajectories so that accuracy of the simulation is not a problem, 
there still remains the issue of how large a sample of trajectories has to be to give 
a precise result. 

While the evolution of rti is well-behaved, being simply constant isotropic diffu- 
sion and possibly some decay, the deterministic evolution of mi can be very sensitive 
to ul due to the exponential drift term. 

Particularly Re{mi} has a great influence on observable estimates of phase- 
dependent observables, such as G^^\0,t). Consider the quadrature observables 

. (7.19) 

which are closely related to the flrst-order correlation function 

^(0, t) = Poim - ^qi-^m) = (Mm + iqi-nm))* . (7.20) 

Comparing to ()5.10|) and ()5.13|) . the estimator for (g) is 

(Re|§(«e*^ + /5e-*^)|), , 



(Re {exp (^0 + ^) cosh {id + ^)]) 



stoch 



We can see that q is very sensitive to Re {m^,} due to the cosh factor. This real part 
of mi evolves as 

dRe {mi} = 4xe^"^"^^ sin(Im {ul}) dt + . . . , (7.23) 

which can be a very rapid growth even for moderate values of Re{nj^}. Worse, 
even a moderate spread in Re {ul} leads rapidly to a very wide spread of Re {mi}, 
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not to mention the spread in the factor cosh(m2,/2) that appears in the observable 
estimate. The behaviour of this spread will be considered in detail in Section 17.5. 3| 
but for now it suffices to point out that there arises a characteristic time scale tgim 
beyond which the uncertainty in phase-dependent observable estimates grows faster 
than exponentially with time, and any simulations become effectively useless no 
matter how many trajectories are calculated. 

Numerical investigations (See Figure FOl and Table UT^ found that a positive P 
simulation lasts at most for times 



For large mode occupation uq ^ 1, this is not enough time for significant phase deco- 
herence to occur (compare to tcoh in fl7.13j) ). This unfavorable scaling is known to be 
a major unresolved stumbling block for positive P simulations of the interacting Bose 
gas model ()2.1|1 in many physical regimes. For example, evaporative cooling simu- 
lations encounter such sampling problems upon the onset of Bose condensation 
The expression (I7.24|) indicates that the simulation time for a many-mode system is 
likely to be limited by the simulation time of the most highly occupied mode (this 
is confirmed by the simulations in Chapter EH), which makes it especially worthwile 
to improve simulation times at large mode occupations. 

Greater stability when damped 

Consider the evolution of the log-occupation rii in ()7.16j) . In a positive P simulation 
(with g" = 0), the noise term produces a spread in both the real and imaginary parts 
of riL of standard deviation a = \J1yt. To a good approximation, all trajectories 
in a reasonable-sized sample will lie within about 4cr of the mean value (which is 
log(no) — 7t). If the values of Re{nL} for all trajectories are <C —log 2%, then 
the nonlinear term in the (im^ evolution equation becomes negligible, and stability 
problems abate. This will occur if the damping 7 is large enough in comparison 
with the two-particle collisions x- In Section I7.5.4| this behaviour will be found to 
depend on the "damping strength" parameter ()7.70|) . which will quantify what is 
meant by "7 big enough" . The increased stability at higher damping is a well-known 



(1.27 ±0.08) 



(7.24) 



sim 
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feature of positive P simulations of the anharmonic oscillator, and/or an interacting 
Bose gasjinj. 

7.2.5 Coupling to other modes and moving singularities 

Non phase-dependent observables such as n = a^a can still be calculated at long 
times for the anharmonic oscillator with the positive P method, because the evo- 
lution of their estimators (e.g. i^"''^) stoch) ^'^^^ depend on the unstable m^. 
This is no consolation, however, because we are ultimately interested in simulating 
many-mode systems. (Singe-mode toy problems will fall even to a brute force trun- 
cated number-state basis calculation, so all these involved stochastic schemes are 
only ultimately justified for many-mode models). When many modes are present, 
the convenient separation of nz, and rriL evolution seen in ()7.16|1 is no longer present 
due to mode- mixing terms (from kinetics and external potentials). This occurs even 
for a single-mode model with nonzero pumping e, or a finite-temperature heat bath 
^bath > 0, which also model underlying mode-coupling processes. If the ul and m^, 
equations are coupled, the growth of the spread of Re {mi} will also feed the growth 
of the spread of n^,, making a// moment calculations intractable after some relatively 
short time. 

Furthermore, moving singularities may appear. Reverting back to considering 
the a and P equations ()7.8p . the Ito positive P evolution of a will be of the form 

da = -2ixa^l3 dt + edt + . . . , (7.25) 

and in particular 

d\a\ = 2x\a\'^\P\ sin(Za + Zp) dt + . . . . (7.26) 

When sin(Za + Z(3) is positive, this violates the no-moving-singularities condi- 
tions ()6.3|) . I a I grows faster then exponentially, and moving singularities may be 
possible, leading to boundary term errors. This did not occur for the closed anhar- 
monic oscillator only because of its special symmetry properties that allowed the 
convenient decoupled logarithmic form of the equations ()7.16p . 
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Thus it can be seen that when mode mixing occurs, there are two potential 
problems that can arise after short times: 

1. Rapid appearance of massive statistical errors, masking observable estimates 
in noise. 

2. Possible systematic biases caused by moving singularities in the equations. 

Ways to deal with these issues will be investigated in this chapter for a single-mode 
system, and for coupled-mode systems in Chapter |H1 

7.3 Drift gauges: Removal of instability 

As was seen in fl7.26|l . the instabilities in the equations arise from the nonlinear two- 
body drift terms, and in particular from the part that leads to super exponential 
growth of |a| or \f3\. Defining the real and imaginary parts of the number variable 

n = a/? = e"^ = n + in" (7.27) 

for convenience, the offending terms are: 

da = 2xan"dt + ... (7.28a) 

dp = -2xPn"dt + ... (7.28b) 

whereas the terms 

da = —2ixan'dt + ... (7.29a) 

df3 = 2ixPn'dt + ... (7.29b) 

affect only the phase of a or (3, and are harmless. Furthermore we can see that any 
nonzero value of n" = \a can lead to moving singularities (in a evolution 

if n" > 0, or in (3 evolution if n" < 0). Note that nonzero n" are characteristic of 
"non-classical" states that cannot be represented by non-singular Glauber P distri- 
butions mixture of coherent states. This is because for all coherent state 
trajectories (3 = a*, and so h = n' only. 

Let us follow the heuristic procedure of Section [6. 3. 21 to remove the instabilities. 
Considering the points there, in order: 
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1. Remove instability: Drift gauges Qk are chosen so that the terms causing 
the instabihty ()7.28j) are canceled by the drift correction. Assuming no other 
changes to the drift takes place, then from ()7.8|) . after some algebra, one 
obtains the required gauges: 

= -^/2t^n"e~3" (7.30a) 
^2 = i^/2i^n"e-^" = -iGi (7.30b) 

These effectively cause the replacement in the stochastic equations 

— 2ixandt —2ixan' dt (7.31a) 
2ixf3ndt 2ixf3n' dt. (7.31b) 

In terms of the logarithmic variables, the scaled gauges ()7.18|) are 

Gin) = (7.32a) 
Qm = -2y^n"e-'" = giV2. (7.32b) 

For the anharmonic oscillator, dn^ is unchanged from the positive P simula- 
tion, and the full equations are 

duL = —■ydt + 2i^/ixe~^ dr] (7.33a) 
dniL = -2i(cj-x + 2xRe{e"^})dt + 2iv^e^"dr7* (7.33b) 
dzo = -2v/ixlm{e"^}e-f"c/r/. (7.33c) 

2. Check again for moving singularities: (Ignoring, for the time being, 
any dependence on g" since the diffusion gauge has not been chosen yet) 

• Comparing to conditions ()6.3|) . it is seen that the linear drift terms in 
parameters e, u, 7, and nbath in f)7.8p will not lead to moving singularities. 

• The new nonlinear drift terms ()7.31|1 have lost their radial component 
due to the drift gauges, and so do not lead to any moving singularities 
either. 
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• From ()B.7|1 . the Stratonovich corrections for da and df] are ixoidt and 
—ixPdt, respectively. These extra terms satisfy ()6.Hj) . and so do not 
contribute any moving singularities in any of the Ito- Stratonovich hybrid 
family of algorithms. 

• That leaves the new dz^ evolution. The weight deterministically tracks 
the riL evolution, and from ()7.33|1 

dzo = ilm{e''^}{dnL + jdt), (7.34) 

and so if = \og{af3) remains finite for all trajectories (as is seen from 
the lack of moving singularities in a and f3 evolution), then so will 2:0- 

Thus it is seen that no new moving singularities are introduced provided g" is 
well-enough behaved. The dependence on g" is considered in Section 17.5.71 

3. The noise terms in da and d(3 are exponential in a, and so also satisfy ()6.3p 
together with the noise terms of dzQ. thus noise divergences are absent, given 
a well behaved g". 

4. As desired, there are no discontinuities in equations, by inspection (provided 
g" is well behaved). 

5. Gauge efficiency. Comparing to corresponding sub-points from Section R).3.2l 

(a) Corrections to the drift are necessary in all of phase space apart from the 
subspaces n" = of measure zero, so generally Q^. ^ apart from this 
special region. 

(b) For general purpose calculations, one wishes to minimize the quantity 
Ylk l^fcP; given that the instabilities are removed. This is achieved by 
the gauges ()7.3()j) as they are only just large enough to remove the insta- 
bilities, and do not introduce any other modifications to the da, or d(3 
equations. 

(c) Attractors? For the single-mode anharmonic oscillator when e = 0, there 
are deterministic attractors in the phase space at vacuum (n = 0), and 
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n = (x — co')/2x (Stratonovich) or n = —uj/2x (Ito). At all these, n" = 0, 
so the gauge is also zero. This is the desired situation. 

In the more complicated case of nonzero e, there will be some stationary 
points elsewhere in phase space. One could try to construct some gauges 
that would behave as ()7.H()j) in the far tails of phase space i.e. as \a\, \P\ — >• 
oo, but would be zero at these stationary points. In the broad picture, 
however, there seems little point to do this for the single-mode case, 
because in a many-mode simulation all modes will be coupled together 
by the kinetic interaction uj^^^, and e will be different or absent. 

(d) Rather than tailoring the simulation for a single observable O, the quan- 
tity l^feP was minimized here to keep the gauge applicable for a gen- 
eral case. 

6. None of the features to be avoided occur. 

(a) The gauge is nonzero over a wide range of phase-space, including regions 
often visited. 

(b) The gauge does not change in a particularly rapid fashion in phase-space 

(c) The gauge is autonomous. 

(d) The drift gauges break the analyticity of the equations, as suggested by 
the conjecture of Section IB. 1.41 

7.4 Exponentials of Gaussian random variables 

From the single-mode equations for the locally-interacting Bose gas ()7.8p (or indeed 
from the many- mode equations ()5.17|1 or it is seen that noise terms due to 

the interparticle interactions are generally of the multiplicative form 

da (X a^dWk + . . . ; d(3 oc l3y/xdWk + . . . . (7.35) 

This leads to Brownian-like motion in the logarithmic variables or rriL, as e.g. in 
()7.16p . On the other hand, the observable estimates (e.g. (n) or G^^\0, t)) typically 
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involve quantities such as a or a(3 — in the original non-logarithmic variables. 
Clearly there will be a lot of averaging over random variables that are of similar 
form to the exponential of a Gaussian. Let us investigate the behaviour of such 
random variables. 

Let ^ be a Gaussian random variable of mean zero and variance unity, thus its 
distribution is 

Pr(0 = exp (-^^ . (7.36) 



The moments of ^ can be found by integration of ()7.36|) to be 



if > is even 



(«'>„„e.= p.37) 
I if > is odd 

Now let us define the exponential random variable 

= voe"^ = e""^ (7.38) 
with positive real a and Vq. Using ()7.37|) . the exponential variable's mean is 

and so, also 



var [v^] = al = vle"'^ [e"^ — I 



, 2 



1 



(7.40) 



If one is interested in estimating quantities such as v^j using S samples, then by 
the Central Limit Theorem, the relative uncertainty in such a finite-sample estimate 
will be 

A^;. = (7.41) 

To obtain only a single significant digit of accuracy, the number of trajectories 
needed is then 



lOOfT^ 



lOOVe'^' - 1. (7.42) 
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1 234 1 234 

a a 



Figure 7.1: Number of samples Smm required to obtain a single significant digit of precision 
in the mean of a random variable that is an exponential of a Gaussian random variable with 
standard deviation a. Scales: (a) Linear, (b) logarithmic. 



This minimum number of required samples (independent of Vq) is plotted in Fig- 
ure l7.1l It can be seen there that for a > 3 the number of samples required becomes 
intractable, and this sets a practical limit on the variance of the logarithmic variable 
Vl = (T^: 

a'<0 (10) , (7.43) 

which will be referred to numerous times in this and following chapters. 

At small standard deviations a < 1, on the other hand, ~ a, and the vari- 
ance of the logarithmic and exponential variables (var [f^,] and cr^, respectively) is 
approximately the same — a result that will also be useful. 

Lastly, this sampling problem can also lead to systematic biases (but not bound- 
ary term errors) in averages of v with finite sample number S once o" > 3, when 
()7.43j) is violated. This is considered in detail in Appendix |X1 

Means of some combinations of Gaussian noises 

Some expressions that will be used in Sections l7.5^ and 17.8.21 are given below. Since 
they contain only odd powers of ^, 

(sin(crO)stoch = (7.44) 
(ecos(a0)3toch = 0. (7.45) 
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And, by the same series expansion approach as for ()7.39|1 



(esin(aO) 



(cos((tO) 



stoch 



stoch 



stoch 



cr 



a 




) 



(7.47) 



(7.48) 



(7.46) 



7.5 Optimization of diffusion gauges 
7.5.1 Aims 

Upon choice of the drift gauges ()7.30|) to remove the offending moving singularities, 
diffusion gauges can be chosen to vastly (as it turns out) improve the sampling 
behaviour of the simulation. The aim here will be to arrive at a diffusion gauge g" 
for the one-mode system that satisfies the following: 

1. Improves useful simulation times in the low-damped high boson occupation 
regime (this is the regime where simulations without diffusion gauges give 
very unsatisfactory results - see Figure 17^ and Section !?. 2. 4|) . 

2. Is easily generalized to many-mode situations. This means, in particular, that 
the gauge choice should allow for the possibility of mode occupations changing 
dynamically due to inter-mode coupling (although this is not actually seen in 
the one-mode toy system). 

3. Is expressed as an exact expression, or does not require excessively involved 
calculations to evaluate. This is important, since in a simulation the gauge 
should be evaluated at each time step if it is to adapt to dynamically changing 
mode occupations. 

4. Applies on timescales of the order of the coherence time (and shorter timescales 
as well). This is a relevant timescale only when mode occupations are larger 
than order unity, and is then smaller than the quantum revival time ^revival = 
tt/x. At lower mean occupations (order unity or smaller), we require that the 
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simulation remains stable for a time 0{l/x), which should be sufficient not 
to prematurely destabilize any coupled highly-occupied modes in multi-mode 
systems. 

5. Depends only on variables in the current trajectory — i.e. on parameters of the 
current coherent-state projector component of the full state p. This is 
necessary if we want to be able to parallelize the calculation, which enormously 
improves calculation times. Also, if the evolution of all possible coherent- state 
projectors can be stabilized to restrict distribution size and statistical error, 
then so will the evolution of all possible states since these are always expressed 
as a distribution, and estimated as an appropriate sample, of such projectors. 

6. We are especially interested in the case of low or absent damping 7 ^ 2%, since 
this is the regime where quantum effects are strongest, and also where the sim- 
ulation is most unstable (see e.g. Sections l7.2.4l and l7.5.4|l . The highly-damped 
regime is a lesser priority, since there a mean-field, or other approximate sim- 
ulation would be sufficient for most purposes. 

This may seem like a lot of conditions for one quantity, but it helps to remember 
that g" is in principle an arbitrary function. 

7.5.2 Variables to be optimized 

Before proceeding directly to searching for advantageous values of the diffusion 
gauge, there remain several more issues to address: 

Which moment to optimize? 

While, strictly speaking, this depends on which moment we might be interested in, 
in general we should concentrate on occupation number (n) or phase correlations 
G^^\ If these basic low-order observables are badly calculated then higher-order 
observables will not do any better, because they are more sensitive to the distribution 
broadness. 
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Consider that estimators for these two observables are (by comparing their defi- 
nitions ()7.9j) and ()7.1()j) to observable estimators in the gauge P representation ()5.1()j) 
and ()5.13p ) are 

in) oc Re{{nh)^^^J (7.49) 
G^'\0,t) oc /5o(fia)3toch or oc a* (n*r)stoch (7-50) 

(the normahzation by denominators R-6 has been omitted). One finds 
from ()7.16|) or ()7.8|) that both dn and dVl are proportional to e~^", so varying the 
diffusion gauge g" acts to scale noise in the number observable {n) ~ (^^)stoch 
smoothly from very large at negative g" to very small at positive g". This does 
not suggest any "optimal" values of g". On the other hand, the evolution of phase- 
dependent expectation values such as G^^\0,t) displays high noise at both large 
negative and positive values of g" but via different processes. (Large noise in = 
at negative g", and large noise directly in a or /3 at positive values). This suggests 
that g" parameterizes some tradeoff between phase-space and weight noise, and 
that there is some intermediate value of g" at which the resulting uncertainty in 
phase observables is minimized. Thus optimization should be based on such phase 
variables. 

Logarithmic variables 

Ideally one would like to optimize for the variance of variables like (aQ), since 
these appear directly in the calculation of the observable G*-^-* . Unfortunately, exact 
expressions for such variances are difficult to obtain in a closed form^, and would 
probably be very complicated if obtained. A complicated form makes it difficult 
to arrive at an expression for the optimum g" that would satisfy condition 3. in 
Section 17.5.11 (i.e. simple to evaluate during each simulation step). 

It turns out, however, that exact expressions for the variances of the logarithms 

of phase-dependent observables such as var [log(a;r2)] = var [zq + 12^/2 + mL/2] can 

^The reason for this becomes clear after proceeding to calculate some logarithmic variances 
in H7.65|l . These require the evaluation of quantities such as (fi"(i')"-"(^"))stoch given in H7.64|l . 
To evaluate var[f2a] — var [e*^^], on the other hand, requires calculating averages of the form 
{n" {ti)n" {t2)n" (t^) , ■ ■ •)sto(.h ^'^ orders, which becomes increasingly involved as the number of 
factors grow. 
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be found with (relative) convenience and ease, and we will look for values of g" to 
minimize this logarithmic variance instead of var [aQ] . 

As further justification of this choice, one can make an analogy between the 
behaviour of log a and a, etc. versus the behaviour of the Gaussian random variable 
vl of Section EH and = e^^ . In both cases the logarithmic variable is generated 
by Brownian motion, although in the log a case, there is additional drift. One sees 
from ()7.40j) that for a < (9 (1), the variances of both the logarithmic and "normal" 
variables are the same. This corresponds to short time evolution of the anharmonic 
oscillator in the above analogy. Furthermore, because of the rapid rise of variance 
with time, the range of a for which a ^ but the simulation gives any useful 
accuracy is fairly narrow and occurs in the more noisy part of the simulation. 

In summary, it can be expected that a optimization of g" in logarithmic vari- 
ances will still give good results. This is borne out by the massive improvement in 
simulation times seen in Figure FT!?! Certainly, however, some further improvement 
could be obtained by considering non-logarithmic variances (particularly since in the 
full simulation, log aVL etc. are not exactly Gaussian distributed due to the effect of 
the drift terms), although it is not clear whether the difference would be significant 
or not. 



Which phase-dependent variable to optimize 



The first-order correlation function G'*^^''(0, t) can be estimated in two ways, as seen in 
()7.50p . Hence, it is best to optimize for the average variance of the logarithm of the 
two random variables VtajS^ and (fiao/?)* corresponding to the G*-^-* estimate. Also, 
since general coherent state initial conditions to be optimized will have arbitrary 
phases, we should optimize a variance of variables related to |G'^^^(0,t)| rather than 
the complex G^^\ 
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7.5.3 Optimization of g" 

\G^^\0,t)\ is estimated by either e*^^ or e*^^, where 

Giit) = Re {log Po + log a{t) + zo{t)} (7.51a) 
GL{t) = Re{log«o + log/5(t) + zo(t)}, (7.51b) 

while the uncertainty in the estimate will be proportional to var [e*^^] or var 
Taking the considerations of the previous Section 17.5.21 into account, let us look for 
such g" = g'^p^ that the mean variance ofGi and Gl is minimized for the anharmonic 
oscillator system. This minimum occurs when 

d (var [GL(^opt)] + var (^^(topt) ) 

^ ^ = 0. (7.52) 



dg" 

Note that the optimal value g'^^^ will in general depend upon the target time topt = t 
at which the variances are considered. 

The formal solution of the gauged anharmonic oscillator equations ()7.HH|1 . as- 
suming uniform initial weight Q{0) = 1, is straightforward to find: 

riLit) = log(no)-7t + v^e-^" [<+(t)-r(t)] (7.53a) 

rriLit) = log{ao/(3o)-2i{uj-x)t-Mx f n'{t')dt' 

Jo 

+Vxe'"[^C{t)-e{t)] (7.53b) 

zo{t) = -Vxe-'" [ n"{t')[7]+{t')+zr]-{t')]dt', (7.53c) 

where using the definition ()7.27|1 . 

,fit) = '^±'-^ (7.54) 

are independent real Gaussian noises obeying 

(^^(^)>stoch = (7.55a) 

(r^±(t)r/±(t')>3,„,, = 26{t-t') (7.55b) 

{vHtmt')),,^,,= 0, (7.55c) 
and so di] = \/i{ri~ — r]~^) dt/2. Also, 

(±(t) = !\^{t')dt'. (7.56) 
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These are time-correlated Gaussian random variables. Using ()7.55|) . they obey 
the relationships 



(C^(^)>stoch 







stoch 



2 min [t, t'] 



The variables to optimize are 



0. 



(7.57a) 
(7.57b) 
(7.57c) 



Gl 
Gl 



log|no|-|t-^ |e-^"r(t) 



+ 



n"{t')r]+{t')dt'. 

(7.58) 



In the Ito calculus the noises at t' are independent of any variables at t', so one finds 



(^■f^)stoch 
(Cl) stoch 



log I no I 



2 ' 



(7.59) 



and using (j7.57b|l 



(^i) stoch 




('^i) stoch 


+ cosh 2g" 


+ 


(^i) stoch 




(^i)stoch 







X {n"it')vHt')CHt)\^^^^dt' 
Jo 

+Xe-'^" I dt' I rft"(n"(t')n"(n)3toch(^+(0^+(i")>3toch' (7-60) 



t ft 



JO 



where use has also been made of the independence of rj^it') and C^(^)- 

To further evaluate ()7.60p . one needs to calculate the averages containing n". 
Define c = y/x^~^" , and uq = u'q + iriQ. Firstly, 



{n"{t)) 



stoch 



stoch 



UqC 



(7.61) 



where we have first used the independence of the to separate the stochastic 
averages, and then applied expressions ()7.39p . ()7.44p . and fl7.47|) to evaluate them. 
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Now, since noises at times t > t' are uncorrelated with those at t' or earher, 











2K(t'))stoch, (7.62) 



using ()7.55b|) . 

Secondly, if t" > t', noting that 

where the C^(^) are independent of the C^(^)) but have the same properties ()7.57|) . 
one finds that 

(^"(^')^"(t"))stoch (7.63) 
= e-^(*'+*")(e-2^^"(*')\ (e-'^^-^*"-*')) (coscC+(t" - 

\ / stoch \ / stoch \ / stoch 



Terms containing (sincO = have aheady been discarded. Evaluating the 

\ / stoch 

averages using ()7.39|) and ()7.47|) . one obtains 

K(^')^"(t"))stoch = {l^ore^*'^^ - in',r + . (7.64) 

Due to symmetry 

'dt' fdt"n"{t')n"{t") = 2 fdt" f dt' n"{t')n"{t"), 
Jo Jo Jo 

and substituting ()7.(j4p . ()7.(j2p . and ()7.(jip into ()7.(i()|l and ()7.59p . then integrating, 

one obtains 



var [Gi(t)] 
var[Gi(t)] 



cosh(25(") 



+ 



2X< ( ) (7.65) 
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The optimum g" = g'^^^ can now in theory be calculated by imposing ()7.52|) . 

Since this would involve the solution of a transcendental equation, it would 
be cumbersome to use in a numerical simulation, going against requirement 3 of 
Section l7.5.H because one would have to execute a time-consuming algorithm at each 
time step. By considering, below, some important special cases, an approximation 
to the optimum g'^^^ applicable under the broad conditions aimed for in Section H'.B.ll 
will be found, which can then be easily used in actual simulations. 

7.5.4 Important special cases 

1. Perhaps the most important special case is when damping is absent (7 = 0). 
This is also the worst case in terms of simulation stability. At relatively short 
times one has 

4xVe"'^"^* < 1, (7.66) 

and the optimum is given (from ()7.52j) and ()7.65|) ) by the roots of the cubic 
in Vg = e~^^°'p': 

SxVl^ol + [4«)2 + 1]V/ -1 = 0. (7.67) 

In the usual case of simulations with sizeable occupation numbers in modes, 
and times up to coherence time (which then is much shorter than the quantum 
revival time vr/x) this short time condition is satisfied. A very useful expression 

g':^t-l^og{\no\./8^t). (7.68) 

applies when the Vg term is negligible. This occurs at long enough times when 
uq is large enough and mostly real: i.e. when 1 + 4(^0)^ <^ (Sx^optl^oP)^'^'^- 
So x^opt niust be at least ^ l/8|noP (a higher limit applies if 7^ 0). 

The opposite case when uq is either too small, too imaginary, or the time is 
too short has the V^^ term negligible and leads to 

9':pt ^ \ log [1 + 4«)2] . (7.69) 
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2. It can be seen that the long time behaviour of the variances depends on the 
"damping strength" parameter 



g = 27 - Axe''^ . (7.70) 

When q is positive, the long time (i.e. qt ^ 1) behaviour of ()7.65|) is asymptotic 
to a linear increase 



Xtcosh2g"-b (7.71) 



var [Gl] 
vbiIGl] 

where b = XC^^^"{[("'o)^ ~ (^o)^]/27 ~ l^-oP/o'} =F 2x^o/7 is a constant. This 



means that the sampling uncertainty grows relatively slowly, and long simu- 
lation times are possible. When q is negative, on the other hand, the long 
time behaviour is " |^ope''''YlQ'l) ^ind we expect a rapid appearance of 

intractable sampling error after some time \q\t ^ 1. 

The parameter q depends on the relative strengths of the nonlinearity and the 
damping, and determines the long time behavior of the statistical error. Note 
though that if damping is present, then for large enough diffusion gauge g", 
q can always be made positive, and the linear (in t) variance regime can be 



reached with a choice g" > log >/2x/7. 

3. An immediate extension to nonzero damping of expressions ()7.67|) . ()7.68|) and 
()7.fi9j) can be derived for times |gtopt| ^ 1 to give the cubic 

SxVl'^ol V/ + a2Vg^ -1 = (7.72a) 

where 

02(^0,7^) = l + 4(?^o) 



_ -27topt 

Jl\2 ' ' 



27^opt 

-2|„„p f i^j2V±%W)l^£:!Z:!!l) . ,r.72b) 

V 7^opt / 

The expression ()7.68p still applies at large enough and real enough no: i.e. 
when 02 <^ (Sx^optl^^oP)^'''^- In the opposite case of 02 ^ (Sx^optl^oP)^''^, one 



has 



g':^,^^logia2). (7.73) 
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7.5.5 Suggested approximate form of diffusion gauge 

Guided by conditions 2, 4, 5, and 6, in Section [7.5.11 we can see that the choice of 
5'approx f*^^ anharmonic oscillator might depend on the four real parameters n'^, 
Uq, xtopt and 7. Because of conditions 4 and 5, we are most interested in the regime 
of small 7, and either x-^opt < O (l/2^/\n^^ when |no| > C(l), or x^opt < C'(l) 
when no < O (1). These cases are covered by expression ()7.72p . 

This can be most easily seen in the two limits |r;,o| ^ 1 and {uqI ^ 1 where for 
small 7, we have gtopt ~ -2(x^opt/|'^o|)^''^ and ^ -4xtopt/ A/lT4(r?^ respectively. 
Hence the condition |g|topt ^ 1 applies for target times x^opt \no\/2^/2 and 
(1 + 2|no|)/4 respectively, which is roughly sufficient for low occupations, and 
more than sufficient for high occupations. 

To obtain an explicit estimate for g'^^^, one can either evaluate the roots of 
the polynomials ()7.67|) or ()7.72|) by standard expressions, which can be still quite 
complicated although reasonably rapid, or use the approximation 

^approx = ^ log {8|%PxV + "-T} . (7-74) 

which reduces to ()7.(j8j) and ()7.7H|1 in their limits of applicability, and works very 
well in practice (see figures 17. 6[ and l7.5|) . 

The discrepancy A between fl7.74|) and the exact optimization obtained by solv- 
ing ()7.52|) with ()7.65p is shown for real hq for a wide range of parameters in Fig- 
ure 17.21 Note that the ubiquitous e~^^" factor is, for small discrepancy, e~^^°p' ~ 
(1 — 2A)e~^^=-'pp''°''. It can be seen that for occupations > (9(10) and/or for times 
shorter than, or of the order of, singly-occupied coherence time the approximation 
is very good. 

The diffusion gauge choice ()7.74|) will be used from here on. 

7.5.6 Relationship between target time and mode occupa- 
tion 

The expression fl7. 74|) was worked out under the conditions that the mean occupation 
of the mode is conserved. In coupled-mode simulations this is no longer the case, and 
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== 0.01 
== 0.02 
== 0.05 
== 0.10 

== 0.40 



-8 -6 -4-2 2 

log(x^opt) 

Fi gure 7.2: Discrepancy A = g^pj. — .9approx between g"pj. (the exact optimization of g" 
by solving H7.52|l using H7.65|l ). and the approximate expression H7.74|l . Displayed is the case 
of no damping (7 ~ 0) and classical initial occupation (no = Uq), shown as a function of topt 
and no. Discrepancy values A are shown as solid CONTOURS with spacing 0.05. Additional 
DASHED CONTOURS shown at very low discrepancy. Dotted line approximates region of greatest 
discrepancy xtopt ~ I/Stt-q^. For comparison, several physical timescales are also shown in GREY: 
time of first quantum revival ^revival, phase coherence time icoh and phase oscillation period iosc- 

could be adapted for by replacing uq by n(t), which explicitly assumes independence 
of trajectories, and a Markovian process. Although the mean over all trajectories 
{Q{t)h{t)) would be a better estimator of the mean boson occupation, this would be 
in conflict with aim 5 in Section [7.5.11 and might lead to biases due to complicated 
feedback mechanisms between trajectories. 

Another assumption used to arrive at ()7.74|1 was that g" would be constant in 
time, which is now no longer the case. This raises the question of how to include 
the target time in g"^^^. Two ways that quickly come to mind is either to calculate 

1. Always optimizing for a time topt forward from the present t (choosing then, 
explicitly, ^opt(V))' o"^- 

2. Optimizing for only the remaining time to the absolute target time topt > t 
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(choosing, then, 5("pt(max[topt - ^,0])). 

In combination, this gives rise to the four possible gauge forms (in the ()7.74|) 
approximation, using ()7.72b|) ) 

^approxW = ^log{8|rio|Vopt + a2(%,7W)''''}. (7- 75a) 

fl'approx(^) = ^log {8|no|\trem + a2(n0,7i^rem)^^^} , (7.75b) 

^approxW = ^l0g{8|fi(t)PxV + «2(^(t),7V)'^'}, (7.75c) 

^7approx = ^l0g{8|fi(t)|2xt,em + a2(n(t),7trem)'/'}(t), (7.75d) 

where the "remaining time to target" is 

trem = max[topt -t,0]. (7.76) 

These strategies have been numerically investigated for the undamped (7 = 0) 

case with real no starting conditions. It turns out that form ()7.75d|) has some 
advantage over the others, particularly at high occupations. Details are discussed 
in Section 17.7.11 and shown in Table 17.11 



7.5.7 Boundary term issues 

When considering the use of an adaptive g"{t,h{t)), one should be careful that the 
g" dependent terms do not introduce new noise divergences that were not present 
when the standard g" = square root noise matrix Bq was used^. The condition 
to avoid noise divergence symptoms (and so, presumably, boundary term errors) is 
as always The stochastic equations should contain no radial component that 

grows faster than exponentially as large radial values of the phase-space variables 
are reached. 

Since the suggested forms ()7.75|) depend directly only on the complex occupation 
variable n = e"^, and not on either of the other independent variables {rriL, and zq), 
then it suffices to only check that the n evolution equation contains no radial super- 
exponential growth. This is because the g" dependent terms in the mi and Zq 

■^Incidentally, any boundary term errors introduced by such noise divergences would have to be 
of the second kind, since the choice of diffusion gauge is made at the level of the FPE-stochastic 
equation correspondence, long after any boundary terms B has been discarded. 
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evolutions simply accumulate integrals of functions oi ul, and if n remains finite, 
then so will the other variables. 

The n evolution is now 



Provided the time dependence of parameters 7, nbath, ^, or x is not pathological, the 
condition ()6.3|) means g" must obey 



for no super-exponential growth to occur. That is, must grow as a non-negative 
power law (or faster) as |n| becomes large. This is seen to be satisfied by the sug- 
gested forms ()7.75|) . Finally, non (yf"-dependent terms were considered in Section EHl 
and found not to lead to any moving singularities. 

It is concluded, then, that the diffusion gauge form ()7.74|1 does not lead to any 
new noise divergences or moving singularities, and hence none of the usual boundary 
term error symptoms have been reintroduced. 

7.5.8 Particle gain 

External particle gain (rather than loss due to a zero-temperature heat bath) com- 
plicates the behaviour. Looking at ()7.77|1 . the bath interactions tend to equilibrate 
the mode occupation to the bath value nbath- If ?^bath (n), then one can expect 
that the optimum gauge calculated with nonzero 7 will be largely unchanged. For 
highly occupied bath modes, or strong coherent particle gain e, a new analysis of g" 
optimization may give better results that ()7.75|1 . however this appears to be a very 
involved procedure. An obvious first try is to just see what happens with the gauge 
form (TTTHl) . 




+ {edt + A/7^bath (i^^bath)/? + {edt+ A/7nb^c?^bath)*a- 



(7.77) 



lim e ^ oc |n(t)|" where a < 0. 

|ri(t)|^oo 



(7.78) 
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7.6 Estimates of simulation times 

The limit ()7.43|) on the variance of exponentials of Gaussian random variables can 
be used to estimate times of useful simulation tsim from expressions for var[G2,] 
— since Gl behaves much like a Gaussian random variable in analogy to a^, and 
observables are estimated by e*^-^ in analogy to oc e°"^. Imposing 

(var [GL(tsim)] + var[Gz.(tsim)])/2 ^ 10, (7.79) 

one can solve for tsim to obtain at estimate at least of the scaling of tsim with system 
parameters. 

Consider first the gauged simulation. For coherent state initial conditions /3o = 
at small damping 7^ -C 1, and short enough times \q\t <^ 1, 

var [Gl] = ^(v, + ^]+ 2{xV,tn',)'. (7.80) 



2 V ' V,, 

For large particle number tiq 1, g"^^ takes the form ()7.(j8|l . so ~ l/2{n'lx^V^^^ 
the X'tVg/2 term is negligible, and var [Gl] ~ 3(noX^t^)^^^/2, and ()7.79|) leads to a 
useful simulation time 

tsim ~ ^ ~ O (10 tcoh) . (7.81) 

Since in this regime \q\t ~ AxtVg ~ 2(xt/'^o)^''^, then |g|tsim ~ C (5) /ng ^ 1, and so 
the expression 1)7.811) is consistent with the original short time assumption \q\t <^ 1 
when n'o > C (5). 

At small occupations <^ 1 on the other hand, g'^^^^ —>■ 0, and at long times, 
var [Gl] ~ ^nQ^e^^*. Thus useful simulation time scales very slowly with n^, and is 



0(l)-ilogn[, 



(7.82) 



X 

The "long time" condition holds while ^n'o^e^^* is much greater than the lower order 
term xt- That is, n'^ > O (10"^). At even smaller n^, the xt term dominates ()7.8())1 . 
and 

tsim~0(lO)/x. (7.83) 
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For comparison, consider the positive P behaviour, which can be estimated 
from the logarithmic variances ()7.92j) for the case with no drift gauge, which are 
worked out in Section [7.8.2I For coherent state initial conditions no = and small 
damping 'yt 1, 



var [Gl] ^Xt + 2n',{xtf + 2x^n',^ ~ 2^ ~ ^ J 



(7.84) 



At short times Axt ^ 1 the leading term in the {•} expression is |(x^)^^o^- In such 
a case, for large mode occupation tiq ^ (9 (1) /xt this term dominates, and using 
()7.79p . one obtains 

Oil) 



(7.85) 



Checking back, 1/xt ^ (9(l)nQ^/^, so ()7.85j) is consistent with the short time as- 
sumption for ?2q ^ 1. At long times, one again has ()7.82p and ()7.83p . The expression 
()7.85p is also useful in estimating simulation times for many-mode models, as will 
be seen in Section HO. 2. II 

Curiously, the "stable" drift-gauged simulation with no diffusion gauge does 
even worse than the positive P. In this case, Vg = 1, and using ()7.80|) . at large tiq 

O (2) 

^sim ~ ;~ ~ ^osc; (7.86) 

which is consistent when rig ^ (9 (|). 

The numerical results in Table mi and Figure ITT^ are seen to agree with these 
estimates to within constant factors of O (1). 



7.7 Numerical investigation of improvement 

To unambiguously determine the improvements in simulation time that are achieved 
by the use of the proposed gauges, simulations of an undamped anharmonic oscillator 
were carried out for a wide range of mode occupations and a variety of gauges, 
testing various target times topt where appropriate. (The undamped system was 
chosen because this is the worst case.) 

Figure [721 compares ^im? the maximum time achieved with various methods at 
which useful precision in the phase correlations G'''^^(0,t) can be obtained. Note 
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the logarithmic scale. Results at high boson occupation are tabulated in Table WA\ 
which includes data for a larger set of gauge choices. Figure 17.61 gives examples 
of calculated values |G'^^^(0,t)| along with error estimates. Table 17^ gives some 
empirical fitting parameters to expression ()7.89|) for the useful simulation time tsim- 
These may be useful to assess simulation times when the particle occupation is of 
O (10) or smaller, and the expressions ()7.8H) or ()7.85|1 are not accurate. 

Details of the sampling error behaviour are shown in Figure 17.51 for a range of 
gauge choices, while Figure 17.41 shows the dependence of useful simulation times on 
the target time parameter topt for a variety of gauge forms for two example mode 
occupations: no = 1 and no = 10''. 

7.7.1 Procedure 

Simulations were carried out with coherent state initial conditions for a wide range 
of mean occupation number 

no = {10-^ 10-^ 10-^ O.Ol, O.l, l, lO, 100, 1000, 10^ 10^ 10^ 10^ 10^°}. (7.87) 

The following gauges, and their varieties as seen in Table 17.11 were tried for each no 
value: 

i) No gauge: Qk = g" = 0. This is the standard positive P distribution technique. 

ii) Drift gauge only: Qk given by ()7.30|) . and g" = 0. 

iii) Both drift gauge ()7.30|) and diffusion gauges of the four related forms ()7.75|) . 

iv) Diffusion gauge ()7.100p only, as described by Plimak et a/j2] (See Sec- 
tion EHSI). 

v) Drift gauge (j7.99|l only, as described by Carusotto et aim (^^^ Section 17.9.11) 

vi) Adaptive diffusion gauges ()7.94j) only (of the four forms ()7.97|l ). and Qk = 0. 
For the diffusion gauges of iii), iv), and vi), which depend on an a priori target 
time parameter topt, a wide variety of target times were tried to investigate the 
dependence between tsim and topt, and ascertain what are the longest simulation 
times achievable. The dependence of tsim on topt is plotted in Figure 17.41 while the 
best times are tabulated in Tables 17.11 17.21 and Figure 17.31 
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The term useful precision has been taken to indicate a situation where the es- 
timate O of the expectation value of some observable O is known to be precise 
to at least one significant digit when S = 10^ trajectories are used. That is, 
AO{S = 10®) < |O|/10. Since for many-mode systems, the calculation of even 
one trajectory is reasonably time consuming, it is clear that S = 10® trajectories 
is very unlikely to be exceeded for most non-trivial problems. Hence, meaningful 
results for non-trivial problems are at best likely to be obtained only in the param- 
eter region where the above-defined "useful" precision condition is satisfied. If one 
calculates some lesser (than 10®) number of trajectories, yet 5^1, then the Central 
Limit theorem can be used to extrapolate AO to the situation when S = 10®, since 
AO oc 1/VS. This gives finally, that the precision is taken to be "useful" when 



"simulation time" tsim that will be referred to here is the maximum time at which 
useful precision in (0, t) / G^^^ (0, 0) | is retained. Each actual simulation was done 
with S = 10^ trajectories. 

Uncertainties in the calculated useful simulated times arise because the AIG^^-*! 
are themselves estimated from the finite ensemble of S trajectories. The uncertainty 
in AIG*-^-*! was estimated by inspection of several (usually 10) independent runs with 
identical parameters. The range of values of tgim seen was taken to be twice the 
uncertainty in tsim- 

The scalings of simulation time with particle number for Uq ^ 1 and ^ 1 
have been worked out in Section 17.61 Taking these results into account. Table E21 
empirically fits the simulation time tgim to 



for intermediate times, as this may be useful for evaluation of simulation strategies in 
a many-mode case, cq characterizes the power-law scaling of tgim at high occupation, 
Ci the pre-factor for high Uq, C3 a constant residual tgim at near vacuum, C4 charac- 
terizes the curvature at small n'p, while C2 is related to the stiffness of the transition. 




(7.88) 



In the current simulations, the observable in question is |G^^^(0,t)|, and the 




(7.89) 
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Figure 7.3: MEiximum useful simulation time tsim, of the one-mode undamped anharmonic 
oscillator with various gauge choices. Initial coherent state mean mode occupations uq — Uq. 
Width of plotted lines shows estimated uncertainty in the values shown. The drift gauge is 
(|7.30|l . while the diffusion gauge is (|7.1UU|I of Phmak et al 2 when on its own, or H7.75dp when 
with the drift gauge H7.3()(l . For comparison, several timescales from Section r7.2.3l are also shown as 
broken lines: time of first quantum revival irovivai, phase coherence time icoh and phase oscillation 
period 



The expression fl7.89|l reduces to cinQ~''^°/x and (cs — c^logn'oJ/x, when ^ 1 
and u'q 1, respectively, in agreement with the limiting expressions fj7.81|) and 
()7.82|) . Uncertainty Acj in parameters Cj was worked out by requiring X]no{['^cst(cj =t 
Acj,no)-tsira{no)]/AtsiraV = Zlno { ^ + ( t^e^t (cj , Wo) - tsim (?^o)] / Atsim)^} • lu the range 
checked {uq G [10~^, 10^°]), the fit is good — i.e. there are no outher data that would 
lie significantly beyond the range of test specified by parameters Cj ± Acj. 
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Table 7.1: MELximum simulation time, at useful precision, of the one-mode system in the 
hmit (no = Uq) ^ 1, achievable with various gauge choices. Calculations are for the undamped 
(7 = 0) system, which is the worst case in terms of sampling error. More details are given in 



Section [7.7.11 For comparison, several timescales from Section [7.2.31 are also given: time of first 
quantum revival ^revival, phase coherence time tcoh and phase oscillation period ^osc- 



Drift gauge Gk 


Diffusion gauge g" 


Useful simulation time 
tsim when Uq = Uq':^ 1 


Maximum no 
for which x^sim > 1 




tosc 











(1.06 ± 0.16) tosc 


0.014 t 'Zt 







(1.7 ± 0.4) W 


0.08 1 








(1.27 ±0.08) /xn'o^^^ 


0.11 ±0.06 




^coh 










(8.2±0.4)teoh 







or (!V.9Va,|) or (!V.9Vc|) 


12 ±3 





(I7.97b|) or (|7.97dD 


(10.4 ± 0.7) W 


19 ±4 




(!7.75h|) 


(25.6 ± 1.0) teoh 


120 ± 30 




or.rnai 


(30 ± 3) tcoh 


150 ± 40 


fl7:T?ni) 


(|7.7ncD 


(32 ± 3) tcoh 


190 ± 15 




(|7.75cH) 


(35 ± 4) teoh 


240 ± 70 




^revival 







Table 7.2: Empirical fitting parameters for maximum useful simulation time isim with 
several gauge choices. The fit is to expression (17.89(1 . 





Positive P 


Both gauges 


Drift gauge 


Diffusion gauge 


Drift Gauge 












Diffusion Gauge 





(!7.75d|) 





(!7.97H| 


Co 


2/3 


1/2 


1 


1/2 


Cl 


1.27 ± 0.08 


17.6 ± 1.7 


5.5 ± 1.2 


5.2 ± 0.4 


C2 


3.2 ± - 


3.6 ± ^3 


^■^ =■= 0.4 


7 _|_ 00 
^. ' =■= 1.0 


C3 


-0.5 ± 0.3 


2.8 ± 0.9 


-0.5 ± 0.3 


-2.4 ± 0.6 


C4 


0.45 ± 0.07 


0.23 ± 0.13 


0.49 ± 0.08 


0.23 ± 0.13 
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Figure 7.4: Comparison of a 'priori target time topt with actual useful simulation time 

tsim for a variety of diffusion gauges: Results for drift gauges (|7.^^Q(I with the four g" forms (17.75(1 
are shown as: (|7.75d|l - solid dark; (|7.75c(I - dash-dotted light; ((7.75b|l - dashed; (|7.75a|l 
- dash-dotted dark. Relationship obtained using the diffusion gauge (|7.100() of Phmak et aZ|2] 
(but no drift gauge) is also shown (solid light). The dashed line in the background shows, for 
reference iopt = ^sim. Subplot (a): mean particle number no = 10'^, (b): no = 1. 
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Figure 7.5: Uncertainty in \G'^^^0,t)\, as a function of time. The quantity plotted is A 



prcc 



v/5/106(10 A|G(i)(0, t)\ I |G(i)(0, 0)1), so that Ap,.oc < 1 corresponds to useful precision as defined 
in Section [V. 7. II Results are plotted for combined drift 1)7.30(1 and diffusion 1(7.75(1 gauges, where 
the four forms are shown as : l(7.75d(l - solid dark; ((7.75c(I - dash-dotted light; l(7.75b(l - 
DASHED; l(7.75a(l - dash-dotted dark. Data for the diffusion gauge l(7.100() of Plimak et aZ[2] 
with topt ~ 3tcoh as used therein is shown as a light solid line, while the ungauged positive P 
calculation is shown as a dotted line. Simulations were carried out with S = 10* trajectories, 
starting with initial coherent state conditions: (a) uq = lO"' (b) no = 1. The combined drift-and- 
diffusion-gauge plots were calculated with target times of: (a) topt = SO^coh (b) topt — 'Hcoh- 
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Figure 7.6: Modulus of the phase correlation function G'^^^(0, i). Comparison of calcu- 
lations with different gauges: Subplots (a) and (d): Ungauged positive P. Subplots (b) and (e): 
Diffusion gauge (|7.1UU|I of Plimak et aZ|2] with topt = 3icoh- Subplots (c) and (f): Combined 
drift and diffusion gauges (|7.30|l and l)7.75d(l with the choice iopt — 20icoh The initial conditions 
were a coherent state with (n) — uq, where in subplots (a)-(c): uq — 10^ particles, and (d)-(f): 
uq ~ 1. Triple Solid lines indicate G*^^-' estimate with error bars. Exact values are also shown 
(single DASHED line). The quantum revival time is shown shaded for the no = I plots. S = 10"* 
trajectories in all cases 



7.7.2 Features seen 

• Combining drift and diffusion gauges gives tlie longest useful simulation times. 
Such simulations give good precision well beyond the point at which all coher- 
ence has decayed away for highly-occupied modes - potentially up to about 35 
coherence times. 

• Diffusion-gauge-only simulations also give quite good statistical behaviour (al- 
though useful simulation times are about 4 times shorter at high occupation 
than with both gauges). 
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• Despite the efficient behaviour of combined gauge simulations, using only a 
drift-gauge gives even worse statistical error than no gauge at all (although, 
this does eliminate potential boundary term systematics in a multi-mode sys- 
tem, which are a problem with no gauge). Such simulations are restricted in 
time to about one phase oscillation. 

• Plain positive P simulations at high occupation last for about tiq^/^ phase 
oscillation periods, which is much less that the time required for significant 
coherence to be lost if no is large. 

• The diffusion gauge forms ()7.75b|) and ()7.75d|) that optimize for the "remaining 
time to target" (topt — t) show statistical errors that are well controlled by 
the choice of the target time topt- Basically statistical error can be reliably 
expected to remain small up to the explicit target time topt, provided that 
this is within the useful simulation range given in Table mi — see Figure ESI 
Thus, a heuristic approach to get the maximum time out of a simulation would 
be to 

1. Pick topt to be some time that one wants to be able to simulate to. 

2. Run a simulation, and see how long useful observable estimates occur: 

^sim(^opt) • 

3. Choose: 

(a) If tsim(^opt) < ^opt then reduce topt to some value between tsim(^opt) 
and the present topt- Using this new value should give a new better 
simulation time also in that range. Iterate back to step 2. Or, 

(b) if tsim(''^opt) ~ topt then either keep the simulation if one is happy with 
it, or one can try to increase topt to perhaps obtain more simulation 
time, going back to step 2. 

The rest of the diffusion gauge forms ()7.75j) do not follow such a simple de- 
pendence and appear to require tedious parameter searching to find the best 
gauge parameter choice for given initial conditions. 
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• Simulations showing quantum revivals have not been achieved. Generally it 
appears that useful simulation times are proportional to coherence times at 
high occupations. In this context, however, it is worth mentioning that quan- 
tum revivals have been seen in this system by Dowling|69j using the gauge P 
representation, by imposing externally-induced time reversal in the mode at 
a time treversai, whcrc tcoh < ^reversal <^ ^revival- The phase cohcreucc was seen 
by Dowling[69J to decay and then revive at t = 2ti.eversai- This indicates that 
the potential for quantum revivals in these stochastic simulations is there, just 
not realized in the free evolution analyzed in this chapter. 

• At low occupation, i.e. of the order of one boson or less, combined-gauge meth- 
ods still give the best results, but the magnitude of their advantage becomes 
smaller. 

• Of the four gauge forms ()7.75|) . the form ()7.75d|) gives the longest useful sim- 
ulations. 

• The diffusion gauge forms (j7.75dj) and ()7.75cj) optimizing depending on the 
occupation of the current trajectory do better than those optimizing on the 
basis of only the initial condition. This is not surprising, since this approach is 
simply better tailored to the predicted subsequent evolution of each trajectory. 

• At low occupations, a broad range of topt choices give much the same statistical 
behaviour. This is most likely because the forms ()7.75|1 are not always the best 
optimizations of ()7.fj5j) in this regime for two reasons: 1) Because the "small 
time" condition \q\t -C 1 is not always satisfied at the long time end of the 
simulation, and 2) because as seen in Figure 17. 2| the form ()7.74|1 is not an 
accurate root of the cubic ()7.72|1 . 

• The simulation times with diffusion gauges (whether accompanied by drift 
gauge ()7.3Up or not) not only have better scaling with uq when tiq is large, but 
this power- law decay of simulation time starts much later, as seen in Figure [721 
and the right hand column of Table 17.11 
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7.8 Diffusion gauges on their own 
7.8.1 Motivation 

The previous Sections 17. 51 to n~7\ considered optimization of the diffusion gauge given 
that instabihties in the equations have been removed using the drift gauge ()7.H()|1 . 
An approach that could be an ahernative is to only use a diffusion gauge g" to make 
a tradeoff between noise in the number and phase variables as before, but without 
introducing any drift gauge. 

Such an approach could have the advantage that no weight spread is introduced 
since dz^ = 0. The weight spread is not a major issue in a single-mode system, and 
clearly from Table mi the drift-gauged simulations last longer, but with many modes 
there may be a problem: All the gauges introduce modifications to the same single 
weight variable. A vivid example occurs if we have M identical but independent 
modes. Then, dzo = ^^1=1 GkdWk, and the standard deviation of dzo is y/M times 
greater than for a single mode but with no new physics. Since the log-weight zq 
enters into the observable estimates in exponential fashion, the situation is again 
analogous to the average of an exponential of a Gaussian random variable, and there 
is the non-scalable useful simulation limit var [zq] < O (10) by condition ()7.43p . Since 
var [zo(t)] ~ OC Mt at short times, this translates to 

tsim OC ^ (7.90) 

in a worst case. 

The major disadvantage of the Q = approach, is that boundary term errors 
cannot be ruled out, as in the plain positive P method. One can, however, try to 
monitor for their appearance using the indicators developed by Gilchrist et al |64j : 
Spiking in moment calculations, rapid onset of statistical error after some time, 
or power-law distribution tails. A caveat is that these indicators tend to emerge 
only for fairly large ensemble sizes S (Typically what happens is that the indicators 
appear with a certain delay time if S small). 

This Q = method variant, although less desirable due to the need for care- 
ful monitoring of the simulation, was found to be much more successful than the 
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drift-gauged method (in its present form) at increasing simulation times for the 
prehminary example many-mode models simulated in Chapter E3 



7.8.2 Optimization of g" and comments 



Proceeding as before in Section 17. 5 .31 the formal solution of the anharmonic oscillator 
equations with fi(0) = 1 are as ()7.53|) . but with the changes 

TTiL = d733B|) +4y [ n"{t' 

Jo 

zo{t) 



dt 



■ 0. 

Following the same procedure as before, one obtains 

var [Gl] 
varfGi] 



(7.91a) 
(7.91b) 



Xtcosh(2/)-4x' n'o 





















+ 


11 



1 -e-T*(l + 7t) 



r 



+4x>o| 



-1^-1 1 - e"^* 1 /I - e"^* 



(7.92) 



q{q--f) 7(7-?) 2 V 7 
where the "damping strength" q is given by ()7.7()|) . The variational condition ()7.52j) 
can now be used to optimize g". 

Considering the same special cases as in Section 17.5. 4| 

1. With damping absent, and at relatively short times \q\t <^ 1, the optimized 
gauge is 



9opt ~ S'approx — . log 



(no cubic polynomial this time. 



-{4xtopt\no\f + I 



(7.93) 



2. The long time behaviour when \q\t ^ 1: When g > 0, the increase in the 
variances of Gl and Gl is still linear just like in ()7.7H) . but with the constant 
b = 4x^|^oP(?/27 — l)/g7 — 4%^ [rig =f raQe"^^"]/7^. When g < the long time 
behaviour is again exponential: Gl ~ 4x^|nopel''l*(g/27 — l)/'jq. 

3. For nonzero damping in the |gtopt| 1 regime, the optimized diffusion gauge 



IS 



fi'opt ~ fi'approx — ^ loj 



{4:Xtopt\no\f 



C2(7^opt) + 1 



(7.94a) 
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-4 -2 
log(x^opt) 

Fi gure 7.7: Discrepancy A = g^pj. — .9approx between g"pj. (the exact optimization of g" 
obtained by solving (|7.52(l using (|7.92(l 'l and the approximate expression (I7.94|l . Displayed is the 
case of no damping (7 = 0) and classical initial occupation (tiq = ?^o), shown as a function of 
topt and no- Discrepancy values A are shown as solid CONTOURS with spacing 0.05. Additional 
DASHED CONTOURS shown at very low discrepancy. For comparison, several physical timescales 
are also shown in GREY: time of first quantum revival trevivai, phase coherence time tcoh and phase 
oscillation period tosc- 



where the coefficient is 



C2iv) 



3 /e-2^(3 + 2t;) + l-4e~ 



2 \ 

which reduces to 02(0) = 1 in the undamped case. 



(7.94b) 



The discrepancy between fl7.94|) and the exact optimization obtained by solv- 
ing ()7.52|) with ()7.92j) is shown for real uq for a wide range of parameters in Fig- 
ure It can be seen that for occupations > O (10) and/or for times shorter than 
singly-occupied coherence time 1/2%, the approximation is good. Compare with the 
analogous case with drift gauge shown in Figure UT^ 

For zero damping, and coherent state initial conditions = Uq, and relatively 
short times 4xte~^^" <^ 1 one finds 



var[Gi(t)] =var[Gz.(t)] =xtcosh2/-2(xt)X + -Xt(^[,)V25". 



(7.95) 
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At large occupation u'q, the gauge ()7.94|1 gives e^" ~ (4xtopt^o)^/3, and one finds 

that all three terms in ()7.95j) are of similar size. Using the condition ()7.4H|1 the 
simulation time is 

tsim~O(10)tcoh. (7.96) 

The 4xte~^^=-'pp'^°'' <^ 1 condition for t < topt implies large occupation Uq ^ O (1). At 
low occupation on the other hand, e^"pp'^°'' 1, the variances are ~ e^^^n'o^/A, and 
one has again ()7.82|) 



7.8.3 Numerical investigation of performance 

The results of numerical simulations using such diffusion gauge only schemes are 
shown in Figures I77H1 and I7.9[ in analogy with Fi gures 17^ and 1731 for the drift gauged 
schemes. Some data is also given in Tables mi and 17^ As before, simulations were 
made with S = 10^ for the mode occupations ()7.87|) . and a wide range of target 
times topt- 

The four gauges simulated were all of the general form ()7.94|) . but with the four 
adaptive varieties forms 

9"it) = ^7l'pprox(^o,V) (7-97a) 

9"it) = ^7l'pprox(^0,trcm) (7.97b) 
9"{t) = ^approx(^(^)> V) (7.97c) 

9"{t) = 9:r,pro.im,trem) (7.97d) 

in analogy with ()7.75|) . where trem is given by ()7.76|) . Otherwise, the procedure was 
the same as described in Section ITTI Additional features beyond what is mentioned 
there include: 

• The diffusion-gauge-only simulations give improvement over the positive P, 
but give simulation times O (4) times shorter than when combined with drift 
gauge ()7.30|) . For example, compare Figure FTHl to Figure 17^ . tsim is still 
^ tcoh at large uq, however. 
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Figure 7.8: Comparison of a priori target time topt with actual useful simulation time 

tsim for a variety of diffusion gauges in the G = schemes: Results for the four forms (|7.97|) are 
shown as: (|7.97d(l ~ solid dark; (|7.97c|I - dash-dotted light; l|7.97b|l - dashed; (|7.97a|l - 
DASH-DOTTED DARK. Relationship obtained using the diffusion gauge H7.100() of Phmak et al^ 
(but no drift gauge) is also shown (solid light). The dashed line in the background shows, for 
reference iopt = ^sim- For the gauge l)7.97d|l . the whole region where useful precision occurs is 
shown by the dotted line. Subplot (a): mean particle number uq — 10"*, (b): no = 1- 



0.5 




0.5 



(b)no = l 




Figure 7.9: Uncertainty in \G''^\0,t)\, as a function of time for various Q — schemes. The 



quantity plotted is Aprcc = ^/S/W{10 A\G^'^'> {0,t)\ / |G(i)(0, 0)|), so that Ap,.cc < 1 corresponds 
to useful precision as defined in Section r7.7.1l Results are plotted for the four diffusion gauge forms 
(|7.97|l . shown as : (|7.97d|) - solid dark; H7.97c|l - dash-dotted light; l)7.97b|l - dashed; (|7.97a() 
- DASH-DOTTED DARK. Data for the diffusion gauge H7.1()0|l of Phmak et al 2 with ^opt — S^coh 
(as used therein) is shown as a light solid line, while the ungauged positive P calculation is 
shown as a dotted line. Simulations were carried out with S = 10"^ trajectories, starting with 
initial coherent state conditions: (a) hq — 10^ (b) tiq = 1. The diffusion-gauge plots in (a) were 
calculated with target times of iopt = lOicoh for forms Ij7.97d|l and (|7.97b|l . and iopt = 7tcoh for 
forms H7.97c|l and (j7.97a|l . In subplot (b), the four forms (j7.97|l were simulated with iopt — Stcoh- 
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• The full adaptive gauge form (|7.97dj) gives the most predictable results, in 
analogy to the drift-gauged case. That is, tsim ~ ^opt for times up till those 
given in Table mi 

• At low Uq the simulation time appears more sensitive to the choice of topt than 
for the drift-gauged simulations. 

• At high occupations ng ^ 1, the pairs of adaptive gauges ()7.97dj) fc ()7.97bj) and 
()7.97a|) fc ()7.97c|l behave identically, and are thus not shown in all plots. The 
simulation appears to be insensitive to whether one uses a gauge choice depen- 
dent on h{t) or nQ. This is probably a feature peculiar to the particle number 
conserving single- mode anharmonic oscillator model, because here h{t) ^ no 
while useful precision is seen. 

• For Uq ^ 1, the time- adaptive gauge forms (j7.97bjl and (j7.97djl lead to a 
peculiar effect if the optimization time topt is chosen larger than the usual 
maximum tsim given in Table 17.11 The statistical error in the G*-^-* estimate 
first rises rapidly, then falls again, and finally grows definitively. This is seen 
in Figure rTTUf a). and the parameter region for which this occurs is shown in 
Figure [7]HK a). In effect one has two time intervals when the simulation gives 
useful results: at short times, and later in a time interval around t O (lOtcoh)- 

7.9 Comparison to recent related work 

Improvements to the basic positive P simulation method for specific cases of in- 
teracting Bose gas systems have been tried with some success in several recent 
pubhcationsjUEllSlEn!- Here we tie these together with the stochastic gauge formal- 
ism, and make some comparison to the results and analysis in the present chapter. 

7.9.1 The work |[T] of Carusotto, Castin, and Dalibard 

The article N-hoson time- dependent problem: A reformulation with stochastic wave- 
functions considered an isolated (i.e. particle-conserving) system of interacting 
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bosons. The "coherent state simple scheme" described in Section III B 2 therein 
can be identified as using drift gauges of the form 

On = {anPn - Ittn^) (7.98a) 

= V2v({anPn-\Pn\') (7.98b) 

when re- written in the notation of this thesis for a many-mode system as in equations 
()5.17p . with mode labels n. For the single- mode system considered in this chapter, 
this corresponds to 

Gi = i^/2i^ {n - \a\'^) (7.99a) 
G2 = y^{n-\f3\'). (7.99b) 

This gauge causes a full decoupling of the complementary a and /5 equations. Like 
()7.H0j) it is also successful in removing moving singularities, since the nonlinear terms 
in the radial equations for d\a\ and d\l3\ are removed. 

There are two major differences between the coherent state wavefunction method 
and the gauge P method: Firstly, the former is hardwired to models that conserve 
particle number. This hardwiring to particles has both a major benefit and a 
disadvantage, as compared to the gauge P method considered in this thesis: 

• Benefit: coherent state amplitudes are not free to explore the entire phase 
space, are bounded from above, and cannot form moving singularities. 

• Disadvantage: Losses or gains from external reservoirs are unable to be 
simulated, and so it is not applicable in its present form to simulations of e.g. 
evaporative cooling or a continuously loaded system such as an atom laser. 

Another major difference is that diffusion gauges were not considered by Carusotto 
et al, and hence simulation times with this method for a single mode were very 
short (^ tosc)- 

The gauges ()7.99|1 could also be applied to the present gauge P method, and their 
efficiency can be compared with the efficiency of gauges developed here: fl7.30|) when 
g" = 0. The gauge ()7.99|) mediates the replacements h — > \a\^ or in the da and 
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Figure 7.10: Spread in trajectory weights ((a) and (c)) and phase correlation function 
uncertainties ((b) and (d)), compared for the drift (only) gauges (|7.99() - dashed, and (|7.30() 
(with no diffusion gauge: g" — 0)- solid, in a one-mode, undamped, gainless system. Coherent 
state initial conditions with initial mean occupations (a) and (b): no ~ 10^, while in (c) and (d): 
no = 1. All calculations arc with 10'' trajectories. 



dp equations (respectively), as opposed to ()7.30p . which only replaces h — > Re{n} 
and does not decouple d/S from da. So, since the magnitude of the spread in fl 
behaves approximately proportional to ^/J2k l^fcP^ ^^e gauge ()7.99|1 is expected to 
produce a broader distribution of weights. Numerical simulations were carried out, 
and the results are shown in Figure 17. 101 for both high and low mode occupations, , 
and Table 17.11 for high occupation. Useful simulation time with ()7.99|1 is somewhat 
smaller than seen with ()7.H0j) . 

In the same published workpQ, a stochastic wavefunction method was developed 
based on Fock number states as well. This method was shown to give good results 
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and not be prone to boundary term errors but unfortunately does not appear ex- 
tensible to open systems in any straightforward fashion because it is very strongly 
hardwired to total particles. 

The stochastic wavefunction method was also adapted in to some cases of non- 
local interactions, arriving at equations that use effectively a similar noise expansion 
to that in ()5.46|) . but were only applicable to potentials having exactly real Fourier 
transforms Un = U~. 

7.9.2 The work [J] of Plimak, Olsen, and Collett 

The article Optimization of the positive-P representation for the anharmonic oscil- 
lator considered a single-mode undamped, gainless system at high Bose occupation 
with coherent state initial conditions. The "noise optimization" scheme applied 
therein to greatly improve simulation times can be identified as an imaginary diffu- 
sion gauge of the form (rewritten in the present notation) 

5^12 = - cosh"^ [2noxtopt] = ^9 a (7.100) 

defined at high occupation or long times (i.e. while 2noX^opt > !)• This is dependent 
on a target time topt (which was taken to be topt = Stcoh in the calculations of 
Ref. j2j), and the initial Bose occupation no = = |aoP- The value of the gauge 
was held constant throughout the calculation, without allowance for a changing 
mode occupation. 

The useful simulation times obtainable with this method are also shown in Fig- 
ures 17.31 and 17.51 and Table 17.11 Their precise dependence on the target time pa- 
rameter topt has been calculated here, and is shown in Figure 17.41 

Comparing ()7.100p with the optimized diffusion gauge ()7.97a|) that is constant 
with time, one finds that in the 1 limit the two gauges differ by a constant: 

g'kino, topt) ~ ^approx(^0, ^opt) + (7.101) 

While the simulation times achievable using g'^ are comparable with those obtained 
with the adaptive gauge fi'approx(^(^)' ^opt) of ()7.97ap . the target time topt is no longer 
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a good indicator, and has a complicated relationship with tsim, as seen in Figure 17781 
The simulation time is also much shorter than with the drift gauged simulation. 

Extensions to multi-mode systems or low mode occupations were not considered, 
however since the system under consideration was only a single mode, the lack of 
drift gauges did not lead to any boundary term errors in this particular case. 

7.9.3 The work [31 of Deuar and Drummond 

Some initial research by Drummond and Deuar into gauges for the single-mode 
undamped anharmonic oscillator system was reported in ^3]. Here the state was 
written as an expansion over a normalized, Hermitian, coherent state projector 
kernel 

^' = 2e"'cos(^ + n") ' ^^-'^'^ 

where the phase variable 6 was real. This is equivalent to imposing Hermiticity on 

the positive P kernel before normalizing it. 

Drift gauges of the form 

=- X^[n" -n' + \a\'^] [tan{e + n")+i] (7.103a) 
G2 = X,/^[n" + n' -\f3\^] [ta.n{e + n")+t] (7.103b) 

were used, where A is a (real) strength parameter chosen a priori. 

Simulation precision was greatly improved, but evident boundary term errors 
were present. This may have been caused by several factors, which are pointed 
out here because they are good examples of what should be avoided when choosing 
kernels and gauges: 

1. The radial super-exponential behavior of da and d(3 is not removed in full, 
hence one can expect that moving singularities will still be present — as is 
ultimately seen as boundary term errors. 

2. The magnitude of the kernel ()7.102j) diverges when 6 —n". Hence, this 
parameter region might be unable to be sampled in an unbiased way. In 
particular, matrix elements of A appearing in the expression ()6.H) for B diverge 
when 9 = ±7r/2, and may lead to nonzero boundary terms once integrated. 
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3. The gauges were dependent on the trajectory weight e , and thus non-autonomous. 
This causes feedback behavior between the phase-space variables a and f3, and 
the weight, which is difficuh to analyze to definitively determine whether mov- 
ing singularities are present or not. (Compare to point 6(c) in Section [6. 3. 2j) . 

4. The gauges diverge at — > ±7r/2, against recommendation 6(b) of Section l6.3.21 



7.10 Gauge recommendation 

Collecting together the analysis, and results of numerical calculations reported in 
this chapter, the following gauge choices appear to be advantageous for a single 
mode (labeled n) of a two-body locally interacting Bose gas that is free to interact 
with its environment: 
Defining 

fin = anPn, (7.104) 

then 

6^n = -«^n = -v^Im{rln}e-< (7.105) 

The local diffusion gauge dependent on the single "target time" parameter topt, or 
more precisely on the "remaining time to target" 

t.e.= ('°'^*"''''^'°^* . (7.106) 

[ if t > topt 

is then given by 

g'^ = ^log |8|rin(t)|\trem + a2''^(^^n(^),7n^rem)| , (7.107) 

i.e. ()7.74|) . The coefficient 02 is 



a^in, v) = l + Aim {ny { — — — j - 2\n\' i ) . (7.108) 
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7.11 Summary 

The principal aim of this chapter has been to develop a "black box" form of drift 
and diffusion gauges. A form that can be expected to remove boundary term errors 
and extend simulation times beyond what is possible with the positive P for a single 
mode of a locally-interacting Bose gas with two-particle collisions that is open to 
the environment. The recommended form is given by ()7.104|) - ()7.108p . These gauges: 

1. Extend simulation time for the anharmonic oscillator beyond what was pos- 
sible with the positive P representation (i.e. when g'-f^ = and Qn = Gn)- 
For high mode occupations, the simulation time is extended by a factor of 
O (^15nl^^^ (see Table 17.11) when starting from a coherent initial state, repre- 
sentative of a single trajectory. This simulation time is O (40) coherence times 
at high mode occupation, and all decoherence behavior is simulated. The im- 
provement in numerical performance is best summarized by Figures \7.'A\ I7.(i| 
and in Table mi 

2. Remove the instability responsible for moving singularities. 

3. Apply for dynamically changing mode occupation. 

4. Apply for simulations of open systems. 

For the aim of using these gauges in many-mode simulations of open interacting 
Bose gases, it is crucial that the gauge choice be freely adaptable to any changes 
caused to the mode occupation (or other properties). Such changes will be caused 
by interactions with the rest of the modes, and an external environment. 

A second possibly alternative method where only diffusion gauges are used has 
been considered, and appropriate optimized diffusion gauges derived. This method 
does not guarantee removal of systematic biases, but appears to have advantages in 
efficiency for many-mode calculations provided the simulation can be successfully 
monitored using the indicators of Gilchrist et a/ [HI] to catch any boundary term 
errors if these form. 

Lastly, the method developed has been tied in with and compared to some pre- 
vious related work in the field jH El Ej- 



206 



Chapter 8 Dynamics of two coupled Bose gas modes 



Chapter 8 

Dynamics of two coupled Bose gas 
modes 

The next step up towards a many-mode simulation is to see how the statistical 
behavior of the gauge methods is affected once mode mixing occurs. In this chapter, 
the effect of the combined drift and diffusion gauges suggested in Section 17.101 on 
dynamical simulations of a two-mode system will be considered (undamped, gain- 
less, with two modes coupled by Rabi oscillations). This system is simple enough 
that there are only a few parameters, yet many of the changes caused by mode 
mixing are visible. 

8.1 The model 

The system consists of two orthogonal modes labeled 1 and 2 with interparticle in- 
teractions in each mode, and Rabi coupling between them. The coupling frequency 
will be restricted to be real for simplicity (complex values can always be considered 
in similar vein with no significant alterations to the following discussion). Time 
units are chosen so that the nonlinear interaction frequency is x = aiid a trans- 
formation is made to an interaction picture in which the linear mode self-energies 
hwjjUj are moved into the Heisenberg evolution of operators. The interaction picture 
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Hamiltonian then is 

2 

+ hY,a]^al (8.1) 

The Rabi frequency is Ui2 (in scaled time units). 

Physically this model can represent, for example, a simplified model of two in- 
ternal boson states coupled by an EM field, or two trapped condensates spatially 
separated by a barrier. This approximation has been used by a multitude of authors 
to investigate the quantum behavior of BECs in two-state systems. For references 
to some of the work in this field see Leggett [301 , Part VII. 

Comparing to the lattice Hamiltonian ()2.17|) of Chapter |21 the modes 1 and 2 
can be identified with some modes n and m, respectively. Then, uji2 = uj^m/x^ 
while X, ^nn, and cUmm have been scaled out or moved into the Heisenberg evolution 
of the aj. It has been assumed that Unm is real, and that all 7n and = 0. 

From the gauge P equations ()5.17|) of Chapter and not yet specifying the exact 
form of the noise matrices, one obtains gauged Ito stochastic equations 

daj = -iujua^j dt - 2iapj dt + ^ B^^ {dWk - Qk dt) (8.2a) 

k 

d(3j = iuJi2P^jdt + 2iajf]]dt + J2B\kidWk-gkdt) (8.2b) 

k 

dn = Y^^kdWk, (8.2c) 

k 

where -ij = 1 -|- 6ij, and dWk are the usual Wiener increments. The noise matrices 
obey 

E^.?^]? = -^^S,ka] (8.3a) 

Y^BfE^ = 2^5,,P] (8.3b) 

= 0. (8.3c) 

I 



H = hu 



12 



a\a2 + ^2^1 



Applying the drift and diffusion gauges ()7.105|) and ()7.107|) worked out in Chap- 
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ter[3to each mode separately, these equations become 

daj = — iu^a^j dt — 2iajRe{nj} dt + iajV^d [cosh g J dWji + i sinh g"dWj2\ (8.4a) 
d(3j = iuup^j dt + 2i(3jRe {hj} dt + pjV2i[-ismh g'-dWji + cosh g'-^ (8.4b) 

dn = -V2inJ2^^{f^j}e~^HdWji-idWj2), (8.4c) 
j 

where fij = ajPj, and 

g] = ^log {8|n,| V + (1 + Alm{n,ff} . (8.5) 

The four independent real Wiener increments dWjk can be implemented by inde- 
pendent Gaussian random variables of variance dt at each time step. 

Proceeding to check the no-moving-singularities condition fj6.3|) as for the single 
mode in Section ESI there is no super-exponential growth of any \aj\, or 
Moving singularities or noise divergences will not occur. 

Two kinds of initial conditions were numerically investigated in detail: 

8.2 Case 1: Coupling to a vacuum mode 
8.2.1 Description 

The system starts initially with a coherent state of mean particle number uq in mode 
1, and vacuum in mode 2. In all simulations of this case, the inter-mode coupling 
strength was taken to be uji2 = 5, but the mean particle number no was varied. 
This mean particle number N = no = "^jinj) = ^j(p^jCij) is conserved during each 
simulation. 

At low particle number, the Rabi oscillations dominate the Hamiltonian, and 
particles oscillate between the modes, without much decoherence. At high particle 
number rio, on the other hand, phase decoherence dominates mode 1, suppressing 
also the coherent transfer of particles to mode 2. 

The particular values chosen to simulate were 



no = {1,17,200,1500,10^}. 



(8.6) 
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The two observables considered were: The fraction of particles in the (initially 
empty) mode 2: 

P. ^ f , (8.7) 
and the local normalized second order correlation functions of modes j: 



The latter quantify the amount of (instantaneous) bunching/antibunching in the 
boson field, and for example, are unity for coherent states, two for thermal fields, 
and 1 — 1/n for Fock number states of n particles. Large values g^^^ > 2 occur 
e.g. for quantum superpositions of vacuum and Fock number states with two or 
more particles where the average particle number is small. For example in the state 
\^p) = sin^lO) + cose|n), ^(2) = (1 - i)/cos2^. 

8.2.2 Procedure 

Three gauge choices were compared for this case. 

1. No gauge, i.e. positive P. 

2. Drift and diffusion gauges as in equations ()8.4j) with ()8.5j) .The target time topt 
was varied to obtain the longest useful simulation time tgim- 

3. Drift gauge (j7.99|l of Carusotto et a/p. As pointed out before, a subtle differ- 
ence from Ref. |1| is that here particle number conservation is not explicitly 
hardwired into the simulation, but happens to arise because no environment 
interactions are included in this particular simulation. This freedom is left un- 
constrained to make conclusions reached on the basis of the two modes relevant 
to a larger open many-mode system. 

In each run, S = 2 x 10^ trajectories were used, and useful simulation precision 
was defined as in ()7.88|1 . and simulation time tgim accordingly. 

In Figure 18.11 actual simulation times are compared to physical timescales and 
expected simulation times based on the single- mode analysis of Chapter [7| Note that 
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fitting parameters to ()7.89|) for tlie Carusotto et al [Tj gauge ()7.99|1 were calculated 
from single-mode simulations to be 

{co, . . . , C4} = {1, 3.3 ± 0.5, 0.9 ^^2> -0-3 ± 0.4, 0.31 ± 0.09}, (8.9) 

and for ()7.105p with ()8.5|) and topt = were 

{co, . . . , C4} = {1, 400 ± 130, 3.6 1^3, 1.2 ± 0.2, 0.42 ± 0.03}. (8.10) 

Example simulations are shown in Figures IH^ 18.31 f8.4[ and l8.51 and dependence 
of tsim on the target time parameter topt are shown in Figure 18.61 

Any data from positive P simulations after the onset of spiking ( numerical 
signature of possible boundary term errors) were discarded. The simulations up to 
such a point, showed no sign of any systematic biases^. 

8.2.3 Features seen 

• With the single-mode gauge choices ()7.105p and (j8.5|) applied directly to the 
twin-mode system, improvement over the positive P and Carusotto et a/ gauge 
()7.99j) IJ is seen for all uq 

• This improvement becomes more and more significant at high mode occupa- 
tions, like in the single-mode system. Generally speaking, the proportions 
between the simulation times obtained with the different gauge methods are 
qualitatively similar to the single-mode situation. 

• At large mode occupations, one can still simulate well beyond coherence times, 
so most interesting phenomena should not be missed (apart from quantum 
revivals, of course). (See Figures IOM8.5|l 

• However, simulation times with all methods are significantly smaller than those 
predicted (by using expected mode occupation no) for single modes from Ta- 
bles O or O or (ra- dHrrnii . 

^The onset of spiking naturally moves to slightly earlier times as larger ensembles of trajectories 
are simulated. This may cause a beneficial "bias-concealing" effect in the following sense: As pre- 
cision increases due to larger S, some small boundary term biases that were originally concealed by 
noise might emerge, but simultaneously some rare trajectories with earlier spikes become included 
in the larger ensemble. These then warn one to discard data from an earlier time than one would 
have with the original smaller ensemble. 
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Figure 8.1: Useful simulation times t^\^ for a mode coupled to vacuum as in Section 
Calculated simulation times are shown as data points, with the symbols denoting gauge used. 

positive P; "Q": drift gauge (|7.1()5|l and diffusion gauge with iopt = 0; "+" drift gauge 
H7.99|l of Carusotto et al \ \ "v" • drift and diffusion gauges H7.105|l and H8.5|l with best target 
time parameters: uj\ito^t = {2.5,0.5,0.5,0.075} for no = {1,17,200,10''}, respectively. Subplots 
(a) and (b) show simulation times based on estimates of the observable p2 , while (c) and (d) times 
based on t/j • Subplots (a) and (c) compare to physical time scales, including Rabi oscillation 
period iRabi = 27r/aji2, while subplots (b) and (d) compare to expected simulation times for a 
single mode using the empirical fits of Table [7?^ and (|8.9|) - H8.10|) . The expected tsim are plotted as 
light fines: dotted: positive P; SOLID: drift gauge H7.1()5|l and diffusion gauge (|8.5|l with topt = 0; 
DASHED: with optimum topt choice; dash-dotted: drift gauge (|7.99() of Carusotto et al^. 



212 



Chapter 8 Dynamics of two coupled Bose gas modes 



1 

0.8 
0.6 
0.4 
0.2 







60 r 








/ 1 




^ 40 


/ \\ 






■ / ^ 






/ 




^6^20- 














y . 















0.25 



0.5 



0.22 0.24 0.26 0.28 




0.25 



0.24 0.26 0.28 

V^Rabi 



Figure 8.2: Coupling to vacuum mode: System with few particles on average (ng = 1). 

Time scaled to Rabi period tRabi — 2h/loi2. Subfigures (a) and (c) show mode 2 particle fraction, 
while (b) and (d) show instantaneous particle number correlations in mode 2. Subfigures (a) and 
(b) show results with positive P light solid and drift gauge H7.99|l dark solid, whereas (c) and 
(d) show results with combined drift gauge H7.105|l and diffusion gauge H7.1()7|l using target time 
iopt — 2.5/u;i2 ~ 0.4tRabi- Exact results also shown (dashed). Triple lines indicate mean and error 
bars. 



• No biases from the exact solution are seen for the cases (no = 1,17) where this 
was compared. 

• The no = 17 case was also simulated by Carusotto et a/T] with (solely) the 
drift gauge ()7.99j) . and a simulation time of ~ 0.55/ci;i2 was seen. This is 
about 20% longer than with the same gauge here. This effect is most likely 
because there total particle number conservation was a priori imposed on each 
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Figure 8.3: Coupling to vacuum mode: System with intermediate number of particles {uq = 
17). Subfigures (a) and (c) show mode 2 particle fraction, while (b) and (d) show instantaneous 
particle number correlations in mode 2. Subfigures (a) and (b) show results with positive P light 
SOLID and drift gauge (|7.99|l dark solid, whereas (c) and (d) show results with combined drift 
gauge H7.105() and diffusion gauge H7. 107(1 using target time cc'i2topt — 0.5. Exact results also shown 
(dashed). Triple lines indicate mean and error bars. 

trajectory rather than only in the ensemble mean as here. The extra freedom 
in the present simulations is necessary to allow straightforward extension to 
open systems, but the price paid appears to be some extra inefficiency. 

• Precision in the highly-occupied mode appears largely insensitive to the choice 
of target time topt, as long as it is within reasonable values (See Figure IH^ . 
A choice of topt = achieves very large improvement over positive P and drift- 
gauge only simulations for these observables. Note that topt = is not the 
same as no diffusion gauge, because ()8.5|) becomes g'j = | log(l -|- 4Im {rij}'^), 
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Figure 8.4: Coupling to vacuum mode: System with large number of particles {uq — 200). 
Subfigures (a) and (c) show mode 2 particle fraction, while (b) and (d) show instantaneous particle 
number correlations in mode 2. Subfigures (a) and (b) show results with positive P light solid 
and drift gauge H7.99|l dark solid, whereas (c) and (d) show results with combined drift gauge 
1)7.105(1 and diffusion gauge ((7.1Q7|I using target time uiutopt = 0.25. Triple lines indicate mean 
and error bars. 



and this appears to have an important effect. The same apphes to mode 2 
when no is not large. 

• At high hq the observables for the nearly-vacuum mode 2 become sensitive to 
target time choice (See Figure IH^ . This is particularly evident for g^^ when 
uq = 10^. The optimum target time topt is then sharply peaked, much as in 
the single-mode case, and tgim ~> ^opt in the optimum region where topt is of 
the order of coherence time for the highly-occupied mode 1. 
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UJut UJut iOut 

Figure 8.5: Coupling to vacuum mode: System with very large number of particles 

(no — 10"'). Subfigures (a), (b), (d) and (e) show mode 2 particle fraction, while (c) and (f) 
show instantaneous particle number correlations in mode 2. Subfigures (a)-(c) show results with 
positive P LIGHT SOLID and drift gauge H7.99|l dark solid, whereas (d)- (f) show results with 
combined drift gauge H7.105|) and diffusion gauge (|7.1U7|I using target time uJi2topt = 0.1. Triple 
lines indicate mean and error bars. 



• One point to note is that "useful simulation time" tgim was based on the 
moment when relative error in a quantity was first found to be too large. In 
calculations of g^'^\ uncertainties are much greater when g^'^\0, t) peaks — see 
e.g. Figure IS31 (f). Good accuracy can often be obtained between peaks for 
much longer times than shown in Figure IHIEl up to about the same simulation 
time as worked out based on the occupation number. This is particularly 
evident in the uq = 10^ case, where the sudden drop-off in tsim is due to this 
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Figure 8.6: Dependence of useful simulation time tsim on the target time parameter topt for 

simulations of Section lO "+" : useful simulation of (ni), "x": of (na), of 5^ \ "Q": of 

(2) 

(72 ■ Nonlinear but monotonic fopt scale used to avoid overlap of data points, while showing trend. 
Subplot (a): no = 1; (b): no = 17; (c): no = 200; (d): no = 10''. iopt = isim shown for reference 

as LIGHT DASHED hnC. 

effect (see solid line of Figure IHl)l( d)). 

• Best simulation times at high no occur for topt <~ 4im- 

• Positive P simulations do better than those based on the drift gauge ()7.99|1 for 
all uq. 

• Best simulation times (based on (n2)) with both gauges appear to scale roughly 
as 



iOutsim ~ 3/% 



1/3 



(8.11) 
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Collective conclusions are left till the end of the chapter. 

8.2.4 Comparison to Fock state stochastic wavefunctions 

The simulation of a seemingly similar 17-particle two-mode system was carried out 
by Carusotto et al[l\ using the "simple Fock state scheme" with stochastic wave- 
functions. This method gave a useful simulation time of ~ 3/^^12 with S = 2 x 10^ 
trajectories. This is longer than seen with either the "simple coherent state scheme" 
therein 0.55/^^12), or the simulation in Figure IH^ here (~ 1.18/^^12). This was, 
however, for number-state initial conditions, so the two types of simulation cannot 
be easily directly compared, and the observable expectation values calculated are 
different . 

Coherent states with mean occupation no, used as starting points here, con- 
tain Fock components with much larger numbers of particles than no. Since the 
spread in the norm, and hence the uncertainty of observables, has been found to 
rise exponentially depending on the quantity Nt = not (from equation (78) in P), 
one expects that useful simulation time of the simple Fock state scheme will be 

1 /3 

~oc 1/ N = l/nQ. This is a sharp decrease compared to the l/ng dependence with 
the combined gauges discussed here, and one expects that for no ^ 17, the Fock 
state wavefunction method will give much shorter simulation times. 

One can attempt to compare schemes for no = 17 in spite of the differences in 
the physical system simulated. It is not clear how best to sample a coherent state 
(which includes Fock state components with different particle numbers) with the 
Fock scheme, but one can estimate that for accuracy one should consider components 
with occupation at least three standard deviations away from the mean giving < 
30 for such Fock state components. A trajectory corresponding to such a component 
might be expected to not diverge for times uut < 1.7. 

This is about 30% longer than seen in Figure a factor similar to that noticed 
between simulations using the gauge ()7.99j) in Ref. jT] using the "simple coherent 
scheme" and here. It is likely the price paid for having the formalism left open to 
simulating gains and losses. 
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8.3 Case 2: Coherent mixing of two identical modes 

8.3.1 Description 

The system starts initially with identical coherent states of mean particle number 
no in each mode. The total mean particle number is = 2 no. During time 
evolution, Rabi coupling of the modes occurs along with some decoherence from 
inter-atom collisions in each mode. This time simulations were carried out with 
constant particle number uq = 100, but the coupling frequency uu was varied. 

At low frequency UJ12 -C no = 100, decoherence local to each mode dominates, 
and phase oscillations in each mode occur with period tosc = '^/'^o, while at high 
frequency uji2 uq = 100, the inter- mode coupling dominates and phase oscillations 
for each mode occur with period 27r/co'i2. One expects that for low coupling the two 
modes should behave largely as two independent single modes of Chapter [7| 

The particular values chosen to simulate were 

= {5000, 500, 50, 5, 0.5, 0.05, 0.005, 0.0005}. (8.12) 

The observable of most interest here was G^^^ (0, t) as in the single-mode case. Due to 
the symmetry of the system, all single-mode observables such as this have identical 
expectation values for both modes, so either mode can be considered. 

The procedure was the same here as outlined in Section 18.2.21 with S = 10^ 
trajectories per simulation. 

In Figure IHIZI actual simulation times are compared to physical timescales and ex- 
pected simulation times based on the single- mode analysis of Chapter [7| Simulation 
times tsim were based on the condition ()7.88|1 for the observable \G^^\0,t)\. 

Figure IHUl shows data from a quick search for optimum target times topt? while 
some example simulations are shown in Figures 18. 9| I8.10[ and 18.111 

8.3.2 Features seen 

• At weak coupling uji2 ^ uq, the simulation times with various gauge choices 
behave qualitatively similar to the single mode of Chapter [7[ and vacuum 
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U)i2 (scaled time units)"^ uji2 (scaled time units)"^ 

Figure 8.7: Useful simulation times tsim for a two identical modes undergoing coherent mixing 
as in Section f8.3l Calculated simulation times are shown as data points, with the symbols denoting 
gauge used. "□" : positive P; "Q" : drift gauge (|7.1U5|) and diffusion gauge H8.5|) with iopt — 0; "+" 
drift gauge H7.99(l of Carusotto e< a/P; "x": drift gauge (|7. 105(1 only. Subplot (a) compares to 
physical time scales, including Rabi oscillation period tRabi — 27r/cji2, while subplot (b) compares 
to expected simulation times for a single mode using the empirical fits of Table 17?^ and l|8.9|l - l|8.10|l . 
The expected tgim are plotted as light lines: dotted: positive P; SOLID: drift gauge ((7. 105(1 and 
diffusion gauge ((8.5(1 with iopt = 0; dashed: with drift gauge ((7.105(1 only; dash-dotted: drift 
gauge ((7.99(1 of Carusotto et al 1 . 



coupled modes of Section 18121 That is, the dual gauge combination ()7.105|) and 
()8.5|1 allows simulation until the system has decohered, and give much longer 
simulation times than the positive P, drift gauge ()7.105|) only, or Carusotto et 
a/gauge ()7.99|) (in that order). The drift-gauge-only methods simulate for less 
time than a single phase oscillation period. 

• Simulation times are shorter for all methods than those given in Table mi for 
the relevant single-mode case. 

• At strong coupling, uji2 ^ uq, the plain positive P method appears to give 
the best results out of the gauges tried. The times achievable with the gauged 
methods are much shorter. Relatively longest with the Carusotto et a/gauge 
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Figure 8.8: Mixing of two identical modes: Dependence of useful simulation time tsim on the 
target time parameter iopt for simulations of Section 18.31 Subplots (a) to (d) arc for coupling 
strengths uji2 = {0.0005, 0.05, 5, 500}, respectively. Data are from single simulations with S = 10^. 



()7.99p . then the drift gauge ()7.105|) . and worst performance is obtained from 
the diffusion-gauged simulation. 

• At strong couphng, none of the methods tried reach coherence time tcoh, but 
many Rabi oscillations can be simulated by all methods, since these are now 
much shorter than tosc- 

• Choosing various target times topt appears not to have no effect apart from 
a worsening of the simulation at large topt values. Note that this was also 
the situation for the vacuum-coupled simulations in Section IH^ with the same 
total particle number = 200, and so this lack of improvement with topt 
may be typical of lower mode occupations when any significant coupling uji2 
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Figure 8.9: Mixing of two identical modes: Weak coupling: (wi2 = 0.0005). Subplots 
(a)-(c) show, respectively, simulations with positive P (light line); with Carusotto et al^ 
drift gauge (|7.99|) (Dashed line); and with both drift gauge H7.105|l and diffusion gauge l|8.5l) 
(Solid line). The exact solution (|7.11b(l for no coupling (ti>i2 = 0) is shown as a dot- 
ted line. Subplot (d) shows relative sampling error in the estimate of |G'-^-'(0, i)|, such that 
Aprcc 10^/SjWA\G^'^^{0,t)\/\G^^'>{0,t) is less then unity while "useful" precision is present, 
when defined as in (|7.88|) . 



is present. 

• At strong coupling, the Carusotto et gauge ()7.99|) allowed somewhat longer 
simulation times, while the signal to noise ratio in the estimate of G^^\0,t) 
was lower with the gauge ()7.105|1 at short and intermediate times — see Fig- 
ure EIIl(d). 
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Figure 8.10: Mixing of two identical modes: Intermediate coupling: {uji2 = 5). Sub- 
plots (a)~(c) show, respectively, simulations with positive P (Light line); with Carusotto et 
a/P drift gauge l|7.99(l (Dashed line); and with both drift gauge (j7.1()5|l and diffusion gauge 
(|8.5|l (Solid line). The exact solution Ij7.11b|l for no couphng {lji2 — 0) is shown as a dot- 
ted LINE. Subplot (d) shows relative sampling error in the estimate of |G''^^(0, i)|, such that 
Aprcc = 10^/sJWA\G^'^^{0,t)\/\G^^'>{0,t) is less then unity while "useful" precision is present, 
when defined as in (|7.88(l . 



8.4 Analysis and Conclusions 

The two kinds of initial conditions considered above represent the two kinds of 
situations generically occurring in all many-mode simulations. Coupling between 
modes of widely differing occupation will lead to behavior similar to that seen in 
Section [8.21 (Case 1). Coupling between modes of similar occupation will display 
features similar to those seen in Section lH31 (Case 2). Adjacent spatial or momentum 
modes typically behave like Case 2, since if a field model is well resolved by the 
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Figure 8.11: Mixing of two identical modes: Strong coupling: {u!i2 = 500). Subplots (a)-(c) 
show, respectively, simulations with positive P (Light line); with Carusotto et al 1 drift gauge 
H7.99|l (Dashed line); and with both drift gauge (|7.1Q5|I and diffusion gauge H8.5(l (Solid line). 
The exact solution for Ij7.11b|l no coupling (tJi2 = 0) is shown as a dotted line, and for no 
collisions {loi2 ~^ oo) as a dot-dashed line. Subplot (d) shows relative sampling error in the 



estimate of |G'(i)(0,t)|, such that Ap^cc = lO^/sJWA\G'^^\0,t)\/\G(^^\0,t) is less then unity while 
"useful" precision is present, when defined as in H7.88|l . 



lattice, then physical properties (e.g. density, and hence mode occupation) should 
not change much over the distance between neighboring lattice points. 

Still, this chapter has not really been a comprehensive assessment of gauge per- 
formance for all coupled mode cases, since not all parameter regimes have been 
explored. For the most general two modes starting in (off-diagonal) coherent state 
initial conditions one could be looking at the 5 complex parameters nj, aj, uju — no 
mean feat to investigate. Rather, this chapter has been a study of several commonly 
occurring cases to get an indication of whether the simulation improvements found 
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for a single mode survive when inter-mode coupling is present, and to what degree. 
Several aspects of the situation have emerged: 

1. When scattering within a mode dominates over the coupling between modes, 
the response of the system to the gauges is similar to what was seen for a 
single mode. That is, 

(a) When mode occupation is high ^ 1, use of the combined drift ()7.1()5j) 
and diffusion ()8.5j] gauges developed in Chapter [3 gives dramatic im- 
provement in useful simulation times and observable precision over the 
positive P method. In this regime, the drift-gauged only (e.g. ()7.105|) or 
()7.99|l ) simulations reduce simulation time from even the positive P case, 
although boundary terms are removed. Using both gauges ()7.105p and 
()8.5p allows simulation to times by which full decoherence has occurred. 

(b) When mode occupation is low tiq < O (1), the improvement gained with 
the gauges is smaller although some improvement is always seen. The 
absolute simulation time is longer than for no ^ 1, however. 

Still, it turns out that in this strong scattering regime, one doesn't need much 
inter-mode coupling UJ12 to reduce the simulation time in absolute terms by 
a factor O {2 — 10), although this reduction mainly affects the time after full 
decoherence has occurred, and not much is physically happening. 

2. The beneficial effect of choosing topt > appears to be strongly suppressed 
when the coupling uji2 is weak but not yet very weak, or the mode occupation 
is large but not yet very large. The topt = diffusion gauge 

g-itopt = 0) = ^log(l + 4Im{n,}'), (8.13) 

does however have a marked beneficial effect whenever two body scattering 
is significant, and simulations using this gauge show very significant improve- 
ment over positive P or g" = simulations for a wide range of parameters. 
This parameter range appears to be uji2 <^ nx, and O (1) ^ nQ < O{10^). 
Simulation times obtained are smaller by about a factor of O (2) than those 
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given for a single mode with the same gauge. At higher occupations, the ben- 
efit gained with ()8.1H|1 abates but nonzero topt values appear to become useful 
again, and continue to provide strong improvements over positive P simula- 
tions. (See, e.g. Figure IHUKd).) The gauge ()8.13|) is convenient also because 
there is no a priori parameter topt- 

3. When the inter-mode coupling dominates, the gauges developed in Chapter [7| 
or by Carusotto et al jT] do not appear to be very useful. They actually reduce 
simulation time as compared to the plain positive P method, although several 
Rabi oscillation periods can always be simulated. 

The transition between the strong and weak coupling behavior appears to be 
at around 

uJi2 ~ Xn, (8.14) 

which is the point at which the expectation values of the coupling and two- 
body scattering energies in the Hamiltonian are approximately equal. 

At the level of the stochastic equations ()8.2j) . the evolution of (say) daj can 
gather noise from three sources 

1. Directly from the local noise term oc aj^fxdWj 

2. Indirectly from the local nonlinear term oc x^j^j, which can amplify variation 
in the local noise term. 

3. From the other mode through the coupling term oc lOyiO^^y 

The drift gauges ()7.105|) developed here or ()7.99p developed by Carusotto et a/p 
neutralize source 2. The diffusion gauges ()8.5j] or ()7.97djl suppresses the direct noise 
source 1. No local gauge can suppress the third source of randomness, however, 
because the randomness in is largely independent of any processes occurring in 
mode j. What happens is that even small randomness in one mode feeds into the 
other, can become amplified, and fed back again. 

The part of the diffusion gauge that is retained at topt = appears more robust 
to this mixing than the additional optimization obtained with topt > 0. 
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Based on this, and the behavior tsim vs. topt shown e.g. in Figures EH and ESI 
a way to proceed with a full many-mode simulation would be to 

1. Run a simulation with local drift gauge ()7.1()5|1 . and topt = diffusion gauge 
()8.13j) . Call the simulation time obtained t^^l^. 

2. Run a simulation with topt = ^sim' obtain a simulation time t^^l^. 

3. If tgj^^ > tg°|j then nonzero topt is be beneficial, one expects tsim ^ ^opt, and one 
can make a search for a reasonable value by now running a simulation with 
^opt = ^liili; and iterating in this manner until no more significant improvement 
is gained. 

If, on the other hand, t^?^ < tg°|j, then it appears that one is in a regime 
where the topt parameter does not affect the dominant noise source, and tg°|^ 
is the best simulation time one can obtain with the ()7.1()5|l fc ()8.5|) dual gauge 
method. 

4. If simulation time is too short, it may be worth trying a plain positive P simu- 
lation, or even a diffusion-gauge-only simulation using ()7.97d|) . while carefully 
monitoring for the presence of spiking, and discarding any data beyond the 
first spike time. (The simulations carried out in this chapter with this ap- 
proach did not show any bias when compared with exact results — see e.g. 
Figures O and ESI). 

In conclusion, it has been found in this chapter that simulation time and precision 
improvement can still be gained with coupled modes by using the local drift and 
diffusion gauges ()7.1()5|) and ()8.5j) . although marked improvement will mostly be 
seen only when two-particle scattering dominates the inter-mode coupling. Other 
non-local gauges may improve simulations in a regime where inter-mode coupling 
dominates but development of such is for future research. 
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Chapter 9 

Single-mode interacting Bose gas 
thermodynamics 

In Section 15.61 it was shown that the gauge P representation can also be used to 
calculate properties of the grand canonical ensemble of an interacting Bose gas 
described by the lattice Hamiltonian ()2.12|) . A convenient feature of calculations 
made with this method (rather than, say path integral Monte Carlo, or variational 
Monte Carlo methods) is that a single simulation can estimate any observables, and 
gives results for a wide range of temperatures T > l/A;Bmax[r]. 

Before attempting full many-mode simulations (see Chapter ITT|l . it is pertinent 
to make sure that the simplest special case of a single-mode is correctly simulated. 
This is a good test case because it can easily be solved exactly for a unambiguous 
comparison, and because it is simple enough that a broad analysis of the statistical 
behavior of the simulation is possible. 

The single-mode model is particularly relevant to a locally-interacting many- 
mode Hamiltonian ()2.17|) . Here the stochastic equations to simulate are given by 
()5.50p . and it can be seen that the nonlinearity and noise in the evolution of local 
amplitudes an for each mode n depend only on the local variables an and Pn- In 
such a situation the nonlinear and stochastic features of the many-mode behavior 
should appear in their entirety in the single-mode toy model. 
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9.1 The single- mode model 
9.1.1 Quantum description 

Isolating a single mode from the description of Section the master equation for 
the evolution of the (un-normalized) density matrix is 

dpu 



dr 



Pe{r)n-H pu (9.1) 



with n = a^a (the number operator) defined in the usual way in terms of boson 
creation and annihilation operators. The "imaginary time" parameter is r = l/ksT, 
and the "effective chemical potential" /ig is defined in fl2.30|) . That expression can 
be integrated to give 



Mr) = I 



-Xn + / Pe{t)dt 

Jo 



(9.2) 



with constant A^, and it is then seen that at low temperatures p{T) Pe{T)- The 
single-mode analogue of the interacting Bose gas Hamiltonian is 

H = hxn (n - 1) , (9.3) 

where any linear contribution oc n has been amalgamated into the chemical poten- 
tial. The initial state is (as per ()2.31|1 ) 

p„(0) = exp [-Kn] . (9.4) 

Physically, this single-mode model is an approximation of e.g. a boson orbital in a 
heat bath at temperature T, and chemical potential fi, where this chemical potential 
includes any kinetic and external potential effects. 

9.1.2 Gauge P stochastic equations 

Defining 

n = ap = n' + in" (9.5) 
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as in previous chapters, the Ito stochastic equations in a gauge P representation are 
obtained from ()5.5()|) : 



da = {^,e — 2hxn) adr + ia^/2hx{dW — Q dr) (9.6a) 
df3 = Q (9.6b) 
dVL = n[{fXe-n)ndT + gdW]. (9.6c) 

The Wiener increments dW can be implemented as independent real Gaussian noises 
of mean zero, and variance dr at each time step of the simulation. The single complex 
drift gauge is Q. 

Because only a single non-weight variable a experiences diffusion, then the un- 
gauged diffusion matrix is Z2 = — 2/ixa^, a single complex function. There are then 
no standard diffusion gauges (as described in Section [4. 4. 2|1 because the orthogonal 
1x1 matrix ()4.68|1 is just = 1, apart from a phase factor. 

Initial conditions on the variables a and /3 are (from ()5.59|) ) 

Poia, (3, Q) = S\Q - l)S\f3 - a*)^ exp , (9.7) 

where 

no = (9.8) 

is the mean particle number at high temperature, a is easily sampled using Gaussian 
random variables. 

The mean number of particles is (n) = Tr [pun] /Tr [p„], and is estimated by 

t',n?>'°" (9.9) 

These stochastic equations exhibit completely different behavior to the dynamics 
of this system ()7.3|1 . Rather they are more similar to the two-boson absorption of 
Section I 



9.1.3 Exact solution 

The quantum evolution of this model is easily evaluated exactly in a Fock number 
state basis \n). This is useful to make a definitive check of the correctness of the 
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stochastic simulations. Using = exp firn — Hr 
this basis pnH = {n\ pu |^) are found to be: 



the density matrix elements in 



Pn 



5„ji exp {[/i - hx{n - 1)] nr} . 



(9.10) 



A long times r — oo, i.e. low temperatures, the density matrix will be dominated 
by the populations for which the exponent is largest. This maximum of the exponent 
occurs when = [limT-_^oo yu(T) + h'x\/2hx- Since n takes on only integer values, 
then the dominant mode occupation will be at the nearest integer to n^o (or the two 
nearest components if rioo takes on half-integer values). Using ()9.2|) . 



1 1 
noo = - + — — hm 



2 ■ 2hx 

and the T ^ state p(r oo) is: 



A vacuum if < 0.5. 



Pe(t)dt 



(9.11) 



Otherwise, if half integer, then p(r oo) is an equal mixture of the 

\noo ± 5) states. 

Otherwise, p(r 00) is the Fock number state with the nearest whole occu- 
pation number to noo. 



9.2 Moving singularities and removal with gauges 
9.2.1 Moving singularity 

Consider, for now, the case of constant effective chemical potential pe- From ()9.fi|l . 
and using ()B.7|) . the Stratonovich stochastic equation for the complex occupation 
variable n is 



dn = n 



PedT - hxi2n- 1) dr + t^/2h^{dW - g dr)] . (9.12) 



In a positive-P-like simulation^, Q = 0, and the deterministic part of the dn equation 

has the phase-space structure shown in Figure a). There are stationary points 
^But with a weighting gauge to allow for complex weight evolution dfl. 
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at vacuum and at 

n = n^ = l + ^, (9.13) 

with the more positive stationary point being an attractor, and the more negative 
a repellor. The deterministic evolution is easily solved, and with initial condition 
n{0) = riQ gives 

air) = f9 14) 

no + (rioo -rio)e-2^>^"-^' ^ ' ^ 

A moving singularity appears along the negative real axis, and if one starts with a 
negative real no = — jn'ol, then n — > oo at time 

rsing = :T^^logfl-— y (9.15) 

Since initial conditions ()9.7jl lead to samples with no G (0, oo), then some low- no 
trajectories can be expected to rapidly diffuse into the negative real part of phase 
space where the super-exponential growth occurs. This will tend to cause either 
misleading systematic boundary term errors or uncontrolled spiking, which renders 
the simulation useless after a short time. As is seen in Figure in this case it is 
spiking. 

9.2.2 Minimal drift gauge 

To correct the problem one has to change the phase-space topology in some way 
to prevent the occurrence of such moving singularities. The radial evolution (i|ri| = 
—2hx\n\n' dT + . . . is at fault, and the offending term can be removed with the gauge 



g = i^/2hx{Re{n} -\n\). (9.16) 

This gauge has the effect of replacing the — 2/ix|n|Re {n} dr term in d\n\ with 
—2hx\n\'^dT, which always attracts trajectories into the phase space in the vicinity 
of the origin. Polar phase evolution is unchanged, and the resulting deterministic 
phase space is shown in Figure IHTTlf c). If n,oo > the attractor and repellor remain, 
while for rioo < 0, there is just an attractor at vacuum. For near-classical trajectories 
having real n, phase-space evolution is unchanged, and the gauge is zero. 
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Figure 9.1: Deterministic phase space for Stratonovich form of the dn equation. The case of 
fioo = 0.5 is shown, (a): positive-P-like simulation Q — Q\ (b): radial gauge H9.17|l : (c): minimal 
gauge (|9.16() . The moving singularity in (a) is shown with large arrow, the attractors at \n\ ~ rioc 
with a thick dashed line. 



How do the modified equations measure up to the gauge choice criteria of Sec- 
tion IHISUP In order: 

1. The moving singularity in dn has been removed (at high \n\ the deterministic 
behavior is restorative towards the origin), and no other moving singularities 
are present. Note that if h remains finite, then so does Q, which just ex- 
periences exponential growth with an exponent that is always finite, since it 
depends only on n. 
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2. No new moving singularities have been introduced. 

3. No noise divergences (new or old) are present since all noise terms satisfy ()6.3|) . 

4. No discontinuities in the drift or diffusion coefficients are present provided the 
time-dependence of /Ue(T) is not singular. 

5. Is the weight spread minimized? 

(a) Gauge is zero when h is positive real, and small in its neighborhood, 
where gauge corrections are unnecessary. 

(b) Explicit variational minimization of Q has not been carried out, but rather 
criteria 6 have been applied. 

(c) Gauge is zero at deterministic attractor n = rioo- 

6. No unwanted gauge behaviors are present: 

(a) Gauge is nonzero in a large part of phase space, so most trajectories 
contribute to removal of bias. 

(b) Gauge changes smoothly in phase space. 

(c) Gauge is autonomous. 

(d) Gauge breaks the analyticity of the equations. 
9.2.3 Radial drift gauge 

The gauge ()9.16p looks good, but it was found that a more severe phase-space 
modification usually gives smaller statistical uncertainties. This better gauge form 
is 

g = ^^/2h^{h-\h\), (9.17) 

which will be called here the "radial" gauge, due to elimination of any deterministic 
polar-angle evolution as shown in Figure I^Df b). The deterministic attractor is the 
entire |^| = rioo circle (or the vacuum if n^o < 0), with some phase diffusion in the 
phase of h. 
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hxT hxT 



Figure 9.2: Uncertainty in n, the estimate of (n) for the /ig = system, (a): uq — 0.1; (b): 
no = 1; (c): no = 10; (d): no — 100; Solid line: radial gauge (|9.17|) : Dashed line: gauge 
H9.16|l . Simulations were carried out with S = 2 x trajectories. 

The efficiency of the two gauges is compared in Figure 19.21 for /ig = and a 
variety of initial no. It is not immediately clear why the radial gauge is better, but 
the improvement is significant whenever uq > O (1). Similar behavior was seen for 
other fif, values. In particular also for /Xe = hx, which corresponds to a different low 
temperature state (i.e. a pure Fock number state — see Figure l973|l . 

Comparison to the heuristic gauge choice criteria of Section 16 . 3 . 21 follows through 
identically as for the previous gauge ()9.16|) . apart from the gauge not being zero on 
the entire |n| = Uoo attractor but only at n = Uoo- 

9.3 Numerical simulations 

The results of some example simulations using the radial gauge ()9.17|1 are shown 
in Figures 19.31 and 19.41 These are compared to exact results obtained from ()9.10|) 
and also to mean field semiclassical calculations. As outlined in Section [5.41 mean 
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Figure 9.3: Temperature dependent observables given constant and no = 10 at high 
temperature, solid lines: simulation with S = 2 x 10^ trajectories. Triple lines indicate error 
bars (often not resolved at this scale), dashed lines: exact values (mostly obscured by simulation 
results) DOTTED LINES: mean field calculation (subplot (b)). The second order correlation function 
5(2) (o,r) is given by 




Figure 9.4: Mean mode occupations at low temperature T = Hx/lOks as a function of 
constant /ig. dashed: exact result — note the quantization, dotted: mean field result, SQUARES: 
simulation results with radial gauge H9.17f) . Uncertainty was of symbol size or smaller. 
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Figure 9.5: Performance of the mean field and un-gauged simulations for comparison with Fig- 
ure I^SJa). Mean mode occupation with /ig = 0.5hx and no = 10. dashed: exact result, 
DOTTED: mean field result, SOLID: un-gauged simulation. 

field calculations such as solution of Gross-Pitaevskii equations are equivalent to 
simulation of only the deterministic part of the Ito gauge P equations. 

Additionally, a positive-P-like simulation with no gauge {Q = 0) is shown in 
Figure 1^751 for comparison with the gauged technique. 

A subtlety to keep in mind in these thermodynamics simulations is that (in con- 
trast to dynamics) the normalization (Re{fi}), which appears in the denominator 
of observable estimates ()3.14j) . is not unity in the S ^ oo limit. This then requires 
a sufficient number of trajectories per subensemble so that the denominator is al- 
ways positive and not too close to zero for any of these, as discussed in detail in 
Appendix 

Features seen in these figures include: 

• Convergence of the gauged simulation is excellent, and precisely reproduces 
the exact quantum behavior. 

• At low temperatures r > 1/hx the semiclassical approximation gives com- 
pletely wrong results for g^"^^ and is out by O (1) in (n), which is also completely 
wrong if mode occupation is < O {1). 

• The gauge P method, on the other hand, reproduces the exact quantum be- 
havior precisely despite using a semiclassical coherent-state basis. 
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• The un-gauged simulation breaks down while still at high temperatures due 
to boundary term induced spiking. 

• Low temperature observable estimates are consistent with the Fock state low 
temperature ground state described in Section 1^.1.31 ((n) approaches integer 
values^ and the two-particle correlation g^"^^ approaches 1 — 1/ {n)-) 

• Mid-temperature behavior is also simulated precisely. 

• At low temperature (long times) uncertainty in the observable estimates is 
greatest for fi^ even integer multiples of hx (and hence even integer multi- 
ple chemical potential fi at low temperatures), and lowest for fi^. odd integer 
multiples of hx- These cases correspond to the low temperature state be- 
ing a mixture of two Fock number states, or just a single pure Fock state, 
respectively. 

9.4 Chemical potential as a free gauge parameter 

Suppose one is interested in obtaining properties at a given temperature T and 
chemical potential /x. This determines the final simulation target time tt and target 
chemical potential ^i{tt), but the chemical potential at intermediate times t < tt 
is not specified. The only conditions are 1) that the quantity ^{t)t be (piecewise) 
differentiable so that /Ue(T) can be calculated, and 2) that A„ = — lim,-_»o(/UT) be 
positive so that the initial distribution ()9.7p is a normalizable probability. Apart 
from these, ^{t) is formally an arbitrary function (and A„ an arbitrary positive 
number). 

This is all in the limit of many trajectories, however. The form of ^{t) at 

intermediate r has no effect on observable estimate means in the limit 5 ^ oo, but 

can have a strong effect on the broadness of the trajectory distribution. And, hence 

— on the precision of a finite sample estimate. In this sense then, /ie(T) (given by 

()2.3()j) ). acts as effectively an additional gauge function. 

^Apart from when /^e is an even integer multiple of hx, since then (n) approaches half-integer 
values. 
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An indication of the characteristics of an efficient form of /ie(T) can be gained by 
comparing calculated observable uncertainties for several basic cases. Let us look 
at the situation when tt = and /i(rr) = hx- (This gives mean occupation 

n{TT) = 0.5628). 

• Case 1: Variation in no(A„), i.e. in the high-temperature starting mode occu- 
pation, while /ie(T) is a constant chosen appropriately using ()9.2|1 . Calculated 
uncertainties in (n) and g^'^\0,T) are compared in Figure WJ^ It is seen that 
initial occupation O {l) — (10) gives the best performance, while too large or 
too small initial occupation lead to excessive uncertainty in the observables. 

• Case 2a: Time-varying /ie, while tTq = 1 is held constant, /ig is chosen to be 
a nonzero constant up to a time Tc, and zero for times r > Tc < tt- Note that 
as Tc — i> 0, the size of fig grows to make up for the shorter time over which 
it acts. Uncertainties are compared in Figure IHHK a). It is seen that if /ig is 
varied too strongly, an inefficient simulation results, although some relatively 
small variation may be beneficial for the g^'^^ calculation. 

• Case 2b: Again time-varying /ig, while no = 1 is held constant, but this time 
He is held zero at high temperatures (low times r < Tc), and makes up for this 
by becoming a nonzero constant for < r < r-p. As tt, the size of the 
required /ig grows. Uncertainties are compared in Figure I^TTIf b). Again, it is 
seen that strong variation in /Xg leads to an inefficient simulation. 

From the simulation data it appears that, other things being equal, 

1. tTq ~ C (rr(rj')) or slightly greater appears to give the best performance 

2. A constant (or nearly so) yUe(T), which keeps the phase-space behavior time 
independent appears to give better performance than simulations for which /Xg 
strongly varies with r. 
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Figure 9.6: Dependence of observable estimate uncertainties on the starting occupa- 
tion no- Shown arc calculated uncertainties in Circles: n = (n) Squares: 5*-^^(0, r), for hxr = 1 
and /i = hx- Calculations were made with the radial gauge l|9.17|l varying the starting occupation 
no and choosing a constant fie according to H9.2|l . Each calculation was with 2 x 10^ trajectories. 



0.01 



o 

^0.005 

o 

< 




0.03 



0.02 



S 0.01 




Figure 9.7: Dependence of observable estimate uncertainties on the form of /ig. Shown 
are calculated uncertainties in Circles: n = (n) Squares: (7^^^(0,t), for hxr — 1 and /i = hx- 
Calculations were made with the radial gauge H9.17|l . starting with hq = 1 initially, but varying 
the form of /Xg. In (a), this was taken to be a nonzero constant for r < Tc and zero for t > Tc- In 
(b), /ie was zero for t < Tc, and nonzero constant for r > Tc- Each calculation was with 2 x 10^ 
trajectories. 
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9.5 Summary 

The drift gauges ()9.16|) and ()9.17|) have been developed to overcome shortcomings 
of the un-gauged positive-P-hke technique. Using these gauges it has been shown 
that a stochastic simulation can precisely simulate the full quantum grand canoni- 
cal thermodynamics of a two-body interacting Bose gas mode or orbital. Features 
include 

1. Simulation with good precision down to the lowest temperatures Hxt S> 1 
where discretization of mode occupation takes place (See e.g. Figures 19.31 
a,nd O|l . 

2. Simulation of a range of temperatures down to a minimum l/Zc^r^- in one run. 

3. Calculation of any desired set of observables from one run. 

The uncontrollable spiking (which was present in an un-gauged simulation) is re- 
moved. 

Numerical simulations indicate that the radial gauge ()9.17|1 gives superior effi- 
ciency for a wide range of parameters. Also, it has been found that if one is aiming 
for a particular target temperature and chemical potential, it appears advantageous 
to choose the intermediate-time chemical potentials in the simulation such that: 1) 
fie is slowly varying with r (or constant), and 2) The starting occupation no(A„) is 
of the same order as the mean occupation at the target time. 

The gauges developed should carry over in a straightforward manner to multi- 
mode simulations when the interparticle scattering is local to the lattice sites as 
described by the Hamiltonian ()2.17j) . This is implemented in Chapter 1111 
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Chapter 10 
Many-mode dynamics 

Part C of this thesis investigates the performance of the gauge P representation 
(including the special case of the positive P with = 0) for many-mode calcula- 
tions. Specifically, the interacting Bose gas. Chapter ^1 will consider simulations 
of many- mode dynamics, while Chapter ^2 concerns calculations of grand canonical 
ensembles. 

Regarding dynamics, it will be seen that an un-gauged simulation (equivalent 
to positive P) can already give many useful results, while under some conditions 
the diffusion gauges of Chapter [3 can be applied directly (i.e. without any addi- 
tional gauge improvements for kinetic coupling) to lengthen the simulation time and 
improve precision. 

The particular example systems simulated in this chapter do not have particle 
losses, which is a "worst case" in terms of simulation time and stability, as has been 
discussed in Section 17.2.41 This is also the case in which nonclassical phenomena are 
most pronounced. It is worth pointing out, however, that the gauge P representation 
method allows the addition of damping and/or losses in a completely straightforward 
way by adding some linear terms of the forms ()5.18|) . or 1)5.211) into the stochastic 
equations. Chapter ^2 on the other hand, considers simulations where the particle 
number strongly varies during the simulation. 

The stochastic wavefunction method of Carusotto et al [1^ and gauge P represen- 
tation methods share many common features, but the former is applicable only to 
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explicitly particle-conserving systems. That method has been used to calculate the 
evolution of the width of an atomic cloud with extended interactions in a similar 
breathing trap arrangement to the example calculation of Section [10.61 here. 



10.1 Simulation procedure 

To simulate dynamics with the gauge P representation, one starts from the equations 
()5.17p to ()5.21|) . For an M-mode system, there are then 2M + 1 complex stochastic 
differential equations. The Wiener stochastic increments were implemented using 
Gaussian noises of variance At, independent at each time step of length At. 

For efficiency reasons, it is convenient to use a split-step algorithm to evolve 
the kinetic energy part of the terms in Fourier space, while doing the rest of 
the evolution in position space. For notational convenience let us define the dual 
coherent state amplitudes (of the bra vector in the kernel ()5.4j) ) as 

< = (10.1) 

If the label z is allowed to stand for either a or a', then the (discrete) Fourier space 
variables are 

(27r)^/2 



V 



J]e-'^"""z„, (10.2) 



where V is the volume spanned by the spatial lattice. Expanding Unm as per ()2.13j) 
and ()2.14j] . the kinetic evolution in momentum space takes the form 

dan = —-^\^n\'^(^ndt + ... (10. 3a) 

da'~ = - ^Iknl"^ a'^ dt + ... (10.3b) 

after some algebra. (The remainder of the terms denoted by ". . . " will be evaluated 
in position space.) The split-step simulation algorithm is then, for each trajectory 
out of S: 

1. Initialize variables choosing randomly either from explicitly known initial con- 
ditions, or by passing variables from a trajectory in a previous calculation (e.g. 
a thermal grand canonical ensemble as will be calculated in Chapter II 1|). 
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2. For each time step At, 

(a) Transform variables to 5n, a'~ in Fourier space {n}. 

(b) Evolve variables forward by a time step At by applying the change p0.3p 
due to kinetic terms. 

(c) Accumulate any moments of variables in momentum space required for 
observable estimates. 

(d) Transform variables to «n normal space {n}. 

(e) Evolve variables (including weight Q) forward by a time step At by ap- 
plying the rest of the evolution due to interparticle collisions, particle 
gains or losses, and external potential. This last appears as diagonal lin- 
ear frequency terms, which now have the form Unn = V^^^/h after kinetic 
processes have been moved to Fourier space. 

(f) Accumulate any moments of variables in normal space required for ob- 
servable estimates. 

In the case when there is no particle gain from the environment (but losses to 
a zero temperature heat bath can be tolerated), the differential equations for all 
the variables a^, Q, take on an exponential and local form dzj oc Zj {j is here 
a generic variable label). It is then convenient to change to logarithmic variables 
log(Q;n), log(an), and zq = logQ. The radial and tangential evolution are then 
simulated separately, which is found to lead to superior numerical stability. This 
allows one to use larger time steps. Efficiency is improved despite the need to 
change back to the non-logarithmic amplitude variables an, a'n to make the Fourier 
transform. 

The integration algorithm used was a semi-implicit half-step iterative method, 
(k = i in the notation of ()B.10|) in Appendix EI)- The variable increments used 
{dan, dzQ etc.) are in the Stratonovich calculus. This algorithm has been found by 
Drummond and Mortimer |82j to have much superior stability to a plain first order 
method (although it is still first order in At). 
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During a simulation, several indicators were monitored to make sure no sampling 
biases occur. These indicators were 

1. Excessive variance in logarithmic variables. All or most of the evolution 
for the interacting Bose gas occurs in an exponential fashion as per dzj cx zj, 
when written in terms of variables that are averaged to obtain observable 
estimates. As described in Appendix bias may occur if the variance of 
log|zj| exceeds C(10). 

2. Sudden spiking, which may be an indicator of boundary term errors in the 
simulations with no drift gauge. This is described by Gilchrist et a/jH^- Some 
care had to be taken not to confuse this spiking with the far more benign 
kind caused by too-small ensemble sizes S, which occurs when evaluating 
quotients of variable moments (as described in Appendix [HI). A characteristic 
distinguishing feature between these is what happens to the time of first spiking 
when the ensemble size is changed. 

• Boundary term related spiking becomes more severe and/or first occurs 
at slightly earlier times as the number of trajectories S is increased. This 
is because as S increases, trajectories closer to the divergent moving 
singularity (and hence spiking sooner) become included in the ensemble. 

• Subensemble-size related spiking abates and/or first occurs at later times 
as the number of trajectories in an ensemble S is increased, due to better 
precision in evaluating the denominator in the quotient. 

3. Change in mean with ensemble size. A robust and widely applicable 
indicator of possible bias is when means of variables or of their functions 
change in a systematic manner (with statistical significance) as the ensemble 
size S is increased. 

On the whole, reasonably precise observable estimates were not brought into ques- 
tion by any of the above indicators, apart from some unusual situations. (An ex- 
ample of such a pathological simulation were thermodynamic calculations of ideal 
gases, where no noise occurs during the evolution. See Section 111.2.31 ) Typically, 
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worrying symptoms were seen only at times when noise had aheady obscured any 
observable estimates. 

10.2 Lattice size and simulation performance 

A lattice simulation will be equivalent to the continuum system provided that 1) 
the lattice encompasses the entire system (i.e. the volume spanned by the lattice is 
large enough) and 2) all relevant features are resolved (i.e. the lattice is fine enough). 
Once in this limit, all simulations will converge to the same physical predictions in 
the many trajectories limit S oo (provided there are no boundary term errors). 
However, the rate of convergence turns out to be affected by the size of the lattice 
spacing. This affects the useful simulation time. 

Before considering some example simulations in Sections 110.41 to fl0.7l the effect 
of lattice size on the simulation time is investigated. 

10.2.1 Scaling due to interparticle scattering 

To get a rough feel for the issue, let us see what happens in an un-gauged (positive 
P) simulation when the kinetic coupling is negligible, and the scattering process 
dominates. Consider the case of local rather than extended (f/n) interactions. Each 
lattice point represents an effective volume AV = Yl^ Ax^, and the mean occupation 
is rln ^ p(xn)A\^ for a system with density p(x). The self-interaction strength is 
X = g/2hAV from ^J^. 

At large mean lattice point occupations n, the simulation time for a single mode 
using the positive P is given from Table mi bv tsim ~ 1.27/x^^^^, based on nu- 
merical calculations. This is also in agreement with the analytic result ()7.85p . At 
low occupations the empirical fits of Table 17.21 can be used instead. Since single- 
mode simulation time decreases in absolute terms with mode occupation, then the 
expected simulation time is limited by the most highly occupied mode: 



Kinetic processes and external potentials have been ignored for the time being. 



2.5h{AV) 



1/3 




(10.4) 
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although at least the kinetic mode coupling will reduce the useful simulation time 
below this level, as was seen in the calculations of Chapter |H1 The expression ()l().4j) 
(or its extensions to include the empirical fits ()7.89p at low mode occupation) is 
qualitatively borne out in the simulations presented later in this chapter. 

Summarizing, finer lattices lead to shorter simulations, scaling as (AVY^^ 
in the strong interaction limit. 



10.2.2 Effect of kinetic interactions 

Let us now consider the effect of the kinetic interactions. The simplest case is a 
uniform coherent gas with density p, scattering strength g, and lattice point vol- 
ume AV. Also, no environment interactions, no external potential, and a constant 
standard diffusion gauge g" applied to each mode separately. 

When this is implemented as described in Chapter [3 and Section 14.4.21 (e.g. as 
in fl8.4j) ). the evolution of the amplitudes of a single spatial mode n is given (via 
dSini), and nn = „) by 



dan 
da' 



-i£^^'^ dt - iV^^'^an dt/h - 2ixan^n dt + dX^^'^ 
-iS'^'^ dt - iV^'^'a'n dt/h - 2ixa'nK dt + dX^^'\ 



(10.5a) 
;i0.5b) 



The quantities £n^ (with z representing either a or a') contain the kinetic interac- 
tions, and from (jKTTjl . ^^TV^ . and 

hAV 



2mV 



E7 |L-~ |2p*kii-(Xn-Xm) 



;io.6) 



where V is the entire lattice volume. The direct noise terms are explicitly 



%\p2^an 



cosh g" dWn + i sinh g" dW, 
—i sinh g" dWn — cosh g" dW^ 



[10.7a) 
;i0.7b) 



Let us write the Sn^ in terms of a mean S''"' and a fluctuating part 6Sn' of mean 



zero: 



sl:\t) = s^'^^)+5s^:\t). 



(10. 
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Similarly the solution of ()10.5|) will have a constant and fluctuating part: 

z4t)=z{t) + 5z4t). (10.9) 

Since the initial conditions are uniform and coherent then both z and S are the 
same for all modes and trajectories. Also, because there are no external forces on 
this system, 



z{t) = v^e*^(*) = ^/^e''^'^ (10.10) 

with some real phase 0{t). 

If (temporarily) the influence of the kinetic terms was ignored, then the analysis 
of the logarithmic variances of a single mode from Sections 17. 5. HI or 17.8.21 would 
apply, and the variance of log \ zn\ would be given by ()7.65p or ()7.92p . (Compare z^ to 
the deflnition of in ()7.5Hl ). In particular, at short times the dominant fluctuating 
contribution would be due directly to the noise terms rather than ampliflcation of 
rin fluctuations by the nonlinear drift term. From ()7.53|) and ()7.15|) . then 



log an{t) 
log<(t) 



^ logn - 2txm + { e-^" [tC{t) - C(t)] 



+ 



(lo.ii; 



where the Cn (^) time-correlated Gaussian random variables deflned as in ()7.56j) 
and ()7.57|1 . and are independent for each mode n. At short enough times that 
var [Re {logZn(^)}] ^ 1 , one would have 

z^{t) ^ z{t)[i + \ogzn{t)-teit)] ^^^^^^ 

where 6{t) = —2xnt, and Zn represents either or a'^. 

In reality there is also, a fluctuating contribution from the other modes mediated 
by the kinetic interactions so that the total short time fluctuations are 

Sz^it) ^ (5<^"^"*Zn(t) + 6^'''''''z^{t). (10.13) 

In the Ito calculus, where Wiener increments are uncorrelated with variables at the 
same time step, the direct and kinetic fluctuations at t are un-correlated, and so the 
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variance of fluctuations will be 

var [ \6zn{t)\] = var [ \6'^"''^' Zn{t)\] + var [ \6^'''''''^Zn{t)\] . (10.14) 

One expects that as long as the first term (due to the direct noise terms dXn"^) 
dominates the fluctuations, then the single-mode analysis of Chapter is accurate. 
In such a regime: 

• The diffusion gauge ()7.107|) (or ()7.94p for the case of = 0) will be well 
optimized. 

• The simulation times of Table 17.11 and 17.21 are accurate (at least for Qn = 
simulations — see Section HO. 2. 4p . 

• The diffusion gauges ()7.107|1 or ()7.94|1 will give improvements of simulation 
time while the mean mode occupation is high (i.e. n = pAV ^ 1 ). 

Let us estimate the relative size of these two fluctuation contributions at short 
times. Using p0.12|) and p0.11|) . the properties ()7.57|) of the one finds 

..r[\6'--z^it)\]^P-^^^^. (10.15) 

Also, directly from the equations p0.5|l and p0.8|l 

^kinetic^^(^) _ _^ /■ ssi'\t')dt'. (10.16) 
Jo 

Using ()l().6j) . and substituting in for Zm with the approximate (direct noise only) 
short time expression ~ ^ + 6'^^'^'^^Zm, one obtains 

2 rt pt 

rlf' I \^ /Adirect^* ^^// ^| ^direct ^ f^ry 

stoch 



var 



/ m,m',n,n' 

X |kjinkji'|^e-^''"'-(''"-^-')e*''"-(^"-^'"). (10.17) 

Since the direct noise at each lattice point in the locally-interacting model is inde- 
pendent, then 

/rdirect^* / .//\ rdircct ^ /./\\ r p(7 COSll 25f"min[t", t'] ( 1 r\ 1 Q\ 

{d Z^,{t )d 2m(t )),toch = Vm' ^ , (10.18) 
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similarly to ()10.15|1 . and the same for all modes. After performing the integrations 
over t' and t", and simplifying the Fourier transforms, one obtains 

var [ \6^-''^z^{t)\] = var [ ] -— — \^~^\'- (10-19) 



n 



For a D-dimensional system with many modes, one can approximate 



V [{2 



where ci(k™^^) is a shape factor (9 (1) that depends on the ratios between momentum 
cutoffs fc™^^ for the various lattice dimensions. For example in ID, Ci = 1, while 
for 2D and "iD when the momentum cutoffs in each dimension are equal, one has 
ci = 3|, and ci = 6|. Using ()10.20|1 . one obtains the final estimate 

var [ r-'-z^m ^ var [ ] gg^^^'^^'^^^ • (10-21) 

It can be seen that: the fluctuations due to kinetic terms 

• become relatively more important with time, and 

• are more dominant for fine lattices (i.e. when AV is small). 



10.2.3 Lattice size and diffusion gauge usefulness 

If the kinetic coupling remains weak (in relative terms) for the duration of a un- 
gauged or diffusion-gauged simulation, then one expects that the local diffusion 
gauges ()7.94j) are fairly well optimized, and will give simulation time improvements. 
Let us investigate under what conditions this is expected to occur based on p0.21|l . 

Firstly, simulation times were found in Chapter [3 to be significantly improved 
when the mean particle occupation per mode is rl > 1. The expected single-mode 
simulation time was found to be tsim ~ O (10) /xv^ — see Table 17.11 It will be 
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convenient to write the lattice interaction strength x = g/2hAV in terms of the 
healing length 

^heal ^ (10.22) 

This is the minimum length scale over which a local density inhomogeneity in a 
Bose condensate wavefunction can be in balance with the quantum pressure due to 
kinetic effects (e.g. it is the typical size of a quantized vortex in a BEC|42| 1^ . 
or the typical size of a density correlation disturbance, as will be seen here). The 
healing length is discussed in more detail e.g. in Dalfovo et a/[31j, p. 481. 

In terms of ^^^^^^ the diffusion gauged simulation time is tgim ~ O (40) {^^'^^^^my^ / h. 
From ()10.21|) . the kinetic fluctuations are expected to be weak (var [ \ 6^^^'^^^'^ z^^t) \ ] ^ 
var [ |(5*^''''^'^*Zn(i)| ] ) up to this time provided that 

The quantity (AVY^'^ = Ax is the geometric mean of the lattice spacing. One sees 
that simulation time improvement using a local gauge diffusion gauge occurs only 
when 

> ^^^''^n^/^ca 

> O(^^^^^i) (10.24) 



with C3 = 7r(3/80ci)^/^ ^ (9 (1) a weakly lattice shape dependent constant. (The 
second line follows since improvements occur only for ?T > 1.) 

That is, the local diffusion gauges ()7.107|1 or fl7.94j) can be expected to 
give simulation time improvements only if the lattice spacing is of the 
order of the healing length or greater. 

10.2.4 Drift gauges and many-mode dynamics 

As has been noted in Section 17.8. If drift gauged simulations additionally encounter 
a scaling problem in many-mode systems because the single log-weight variable Zq 
accumulates fluctuations from all modes. 
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One has the Ito equation for the log- weight: 

dzo = dWn - dt + dW^ - dt I , (10.25) 

jj k L J L ^ J 

with independent contributions from each mode. In a uniform gas on M = V/ AV 
modes each with occupation n, the contribution from each mode is identical on 
average, and the variance of real and imaginary parts of dz^ will scale as var [zq\ oc M. 
Because Zq appears as f2 = in observable estimates, there is a limit ()7.43p to how 
large the variance of Re {zq} can be if any precision is to be retained in the simulation 
(see Section E3j). At short times var [Re {^o}] tM, which leads to a reduction of 
simulation time tsim oc 1/M in this case. 

In the simulations with drift gauge ()7.105|1 the diffusion gauge g" acts to trade-off 
fluctuations in the amplitudes oc e^" with fluctuations in the log- weight oc \/Me~^" . 
This indicates that when many modes are present, a more advantageous trade- 
off might be achieved by increasing g" relative to the single-mode expression than 

Let us investigate this. With M identical modes in a uniform gas, the changes 
in the optimization of Section 17.5.31 can be easily tracked. The last term of ()7.60p 
for each mode is M times greater, and so the oc |nop term in ()7.65|1 also acquires 
a factor M. This factor also appears in the third-order (in Vg) term of the cubics 
()7.67p and ()7.72j) . In the end, the new optimization gives 

g';^ = hog [m\K{t)Wt,,^ + a^/'(n„(t),7„t,em)} (10.26) 

instead of the single-mode expression ()7.107j) . Simulation time can be estimated as 
in Section 171)1 At short times 

var [Gl] = ^ (^1/ + 1^ + 2M{xVgtn',f (10.27) 

(compare to ()7.80|) ). while for large particle number Vg ~ l/2{n''lxtMY^^ . One 
obtains 

.^.^^S^^oi"^^. (10.28) 



so simulation appears to be reduced by a factor of M as compared to the single- 
mode case (Much better than a 1/M reduction with the single- mode expression 

(EMI)). 



Section 10.3 Correlation functions 



253 



Unfortunately there is some further noise process that hmits simulation times 
with drift gauged dynamics simulations on large lattices. A preliminary trial with 
a uniform ID gas on a 50 point lattice with Axi = AV^ = 10^^^^', and n = 1000 
bosons per lattice point was tried, but numerical simulations with the gauge p0.26p 
did not show improvement to simulation times over the = case. (Nor did a 
combination of drift gauge ()7.3()j) with ()7.107|1 or g'^ = 0.) Only some improvement 
of precision at very small times <^ tgim was seen. Presumably kinetic effects lead 
to a large increase in zq fluctuations in this regime in comparison with what is 
expected for M uncoupled modes, and neither the original optimization not the 
above re-optimization of g'^ gives improvement. 

A combination of local drift and diffusion gauges was seen to give improvement 
under some conditions in Chapter|Hl There it was concluded that the kinetic coupling 
needs to be relatvely weak for this to occur, but details remain to be investigated. 
Simulations shown in the remainder of this chapter were made with zero drift gauge, 
and diffusion gauges only. 

10.3 Correlation functions 

In most experimentally realized Bose gas systems, low order local observables such 
as density or energy are often well described by approximate theories such as Gross- 
Pitaevskii (GP) equations (for a BEG), or statistical approaches in a high tem- 
perature gas. Multi-particle correlations are not well described by these theories, 
however, and so are of more interest for first-principles calculations. In terms of 
local lattice annihilation/creation operators those correlation functions that will be 
considered here are: 

1. First order: 



This is a phase-dependent correlation function. Its magnitude \g^^^\ tells one 
the degree of first-order spatial coherence, while its phase gives the relative 
phase of the wavefunction at spacing Xn — x^. 




(10.29) 
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2. Second order: 




(10.30) 



This two-particle number correlation function is not phase dependent, and 
always real positive. It describes the likelihood of finding two particles at a 
spacing — from each other, relative to what is expected of a coherent 
field. For a bunched field, g^'^\x,x) > 1 (e.g. a thermal state has g^'^\x,x) = 
2), while antibunching is evidenced by (7*^^^(x, x) < 1. 

3. Third order: 



This three-particle correlation function describes the likelihood of particles at 
Xn, Xm, and x^/ (ralative to a coherent field). The rate of three-body processes 
is proportional to ^'^'^■'(x, x, x) (or, if the effective radius of particles is r, three 
body processes will depend on values of ^''■^''(x, x + ei, x + 62), where \ej\ < r.) 
For example in a BEC, the rate of three-body recombination, which can limit 
the condensate's lifetime, is proportional to q^^Hx. x. x)[lHj. 

For the uniform gas, average values of these (over all space of M lattice points) are 
of relevance: 



9 




(anfln) (aLflm) {ajn'^^m' ) 



(10.31) 




(10.32a) 



m 



9 




Xm + y)- 



(10.32b) 



m 



10.4 Example 1: 



Correlation waves in a uniform gas 



10.4.1 The system 

This system consists of a uniform gas of bosons with density p and interparticle 
s-wave scattering length a^. The lattice is chosen with a spacing Axd ^ cts so that 



Section 10.4 Example 1: Correlation waves in a uniform gas 



255 



interparticle interactions are effectively local at each lattice point with effective field 
interaction strength g given by ()2.1fij) or ()2.19j) . depending on the dimensionality of 
the system. Coupling strength between lattice points, X; is then given by ()2.18|) . 
Periodic boundary conditions are assumed. The results obtained from a simulation 
will be invariant of lattice size provided the lattice spacing is fine enough to resolve 
all occupied momenta, and the lattice volume V is large enough to encompass all 
phenomena. 

The initial state is taken to be a coherent wavefunction, which is a stationary 
state of the ideal gas with no interparticle interactions (i.e. g = 0). Subsequent 
evolution is with g > 0, so that there is a disturbance at t = when interparticle 
interactions are rapidly turned on. Physically, this kind of disturbance can be cre- 
ated in a BEC by rapidly increasing the scattering length at t ~ by e.g. tuning 
the external magnetic field near a Feshbach resonance. 

The behavior of a uniform gas can tell us a lot about what goes on in more 
complicated inhomogeneous systems. If the density of a system is slowly varying 
on length scales of Lp or less, then any uniform gas phenomena of spatial extent 
Lp or smaller will also be present in the inhomogeneous system. In a trapped gas, 
for example, uniform gas phenomena that occur on length scales of the order of a 
fraction of the trap size or smaller. 

10.4.2 Correlation waves in a one-dimensional gas 

The evolution of some of the quantum correlations that appear in a one-dimensional 
gas with density p = 100/^'^*^'^' is shown in Figures llO.ll 110.21 and 110.41 based on 
Qn = simulations. Figure 110. HI also shows the momentum distribution of particles 
at times after being excited by the disturbance at t = 0. 
Time is given in units of the timescale 



(the "healing time"), which is approximately the time needed for the short-distance 
l^^hcai^^ inter-atomic correlations to equilibrate after the disturbance (see Fig- 
ure [in3K a)). 



k 





h 
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Figure 10.1: Evolution oig^'^\x) in a uniform ID gas with density p = 100/^''°'''. Shading 
indicates relative magnitude of g^^^(x), with light regions indicating high values > 1, and dark 
indicating low values < 1. The diagram is a composite of data from three simulations with 
different lattices: Ax = 0.12^h<=^\ 500 lattice points and S = 3000 for t G [0,t^] and \x\ £ 
[Q^29^hoai] ^ 0.24^'^'=^', 250 points and S = 10"* for t e m,3t^]. \x\ G [0,29^'''=^'], and 

Ax — S}^'^^^, 200 points and S = 10"* for the rest of the data. These changes in resolution are 
responsible for the apparent slight discontinuities in the data. The apparent weak cross-hashing 
superimposed on the plot at long times and/or large distances is due to finite sample size and/or 
lattice discretization effects and is not statistically significant. Magnitudes of g'^-* are shown 
in Figure 110.41 [arXiv note: higher resultion available at Piotr Deuar's homepage, currently 
http : / / www . physics . uq . edu . au/ people/deuar /thesis/] 
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Figure 10.2: Short time evolution of g^^\x) in a ID gas with density p = 
100/^^'*''^ From a simulation with <S = 3000 trajectories, 500 points and Aa; = 
0.12^'^'''*'. [arXiv note: higher resultion available at Piotr Deuar's homepage, currently 
http : //www . physics . uq . edu . au/people/deuar /thesis/] 
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Figure 10.3: Distribution of velocity in the uniform ID gas of spatial density p = lOO/^*"^**' 
at several times t after the initial disturbance. Plotted is the velocity density p per unit volume of 
the gas. Velocities v are given in units of v^Cbeo which is the speed of the strongest correlation 
wave. The background density of stationary atoms has boon omitted. Triple linos indicate error 
bars at one standard deviation significance. <S = 10** trajectories, Aa; = ^^'^'^^^ 100 lattice points. 
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Figure 10.4: Correlation functions in a uniform ID gas with density p = 100/^''''^'. solid 
lines show dashed lines show g'^', dotted lines show Ig^^-*]. Triple lines indicate error bars 
at one standard deviation significance. Subplot (a): Time dependence of local correlations, (b): 
Correlations at time t = t^. (c): Time dependence of correlations at distance x — 4^'i'=^i. (d): 
Long time correlations at i = 8^^. Data in (a), (b), and t < 1.25t^ in (c) are from a simulation 
with Ax — 0.24^^°^' and 250 points, while the rest is from a Ax = ^hcai simulation with 200 points. 
S — 10"* trajectories in both cases. 



Perhaps the most interesting feature noted in the simulations is the propagating 
wave train that appears in the two-particle correlations. These waves are seen only 
in the two- (or more) particle correlations, while the density always remains uniform 
(by symmetry, since there are no gains or losses in this model), and shows no sign 
of any wave behavior. This "correlation wave" is best seen in the time-dependent 
behavior of ^^^-"(x) in Figure fTO.ll 

Once some initial transient effects at times t < O (tg) after the disturbance die 
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out (these are shown in Figure Il0.2|l . the long time behavior of first and second 
order correlations appears as follows: 

• At distances O [C,^'^^^), antibunching occurs — See e.g. Figures HU.H 110.4T b). 

• Enhanced two-particle correlations occur at a quite well-defined interparticle 
spacing, which increases at a constant rate: ~ per t^. This is a factor 
a/2 faster than the low-momentum sound velocity in a BEC, obtained from 
the Bogoliubov dispersion law: 

Cbec = \fp9jrn. (10.34) 

See e.g. Dalfovo et al^ p. 481. 

• There are also many weaker correlations (and anti-correlations) at larger dis- 
tances that move apart at increasingly faster rates, as seen in Figure 110.11 
These leading correlation wavelets are particularly well visible in Figure lTO. 41 (b) 
The leading disturbances appear to move at ^ 20cbec for the p = IQQ/^^'^^^ 
system, although this is only clearly seen in this simulation for times < 1.5^^ 
when transient effects may still be significant. 

• Looking at the momentum spectrum of the particles fFigure [10.3^ . there are 
peaks, but their position changes with time (towards lower momenta), and 
in particular there is no peak corresponding to any particles travelling at the 
speed of the main (trailing) correlation wave. 

• The peak to peak width of these correlation waves is of the order of several 
healing lengths for the times simulated, although some slow spreading is seen. 

• At long times > O {t^) at distances shorter than the strongest trailing correla- 
tion wave with velocity v^Cbec, but longer than the healing length, long range 
second order coherence {g^'^\x) = 1) between particles reappears. 

• Long range phase coherence |^'-^^(x)| decays with time due to the scattering 
processes, (approximately linearly, at least while l^^^-*] ~ 0{1)). See e.g. the 
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shorter time behavior of Figure Il0.4lf c). and the long distance behavior of 
Figure EEKb) and (d). 

• However, in the (2nd order) coherent neighborhood that appears after the last 
correlation wave has passed, phase coherence decays only up to a certain value 
(see Figure IT0.4lf c^ after the last 'g^'^^ correlation wave has passed. This steady 
state phase coherence value drops off approximately linearly with distance, as 
seen especially in Figure ITIOlf d). 

• The third-order correlation g^^^ function also displays some wave behavior at 
similar times and distances as g^'^\x). This is visible e.g. in Figure IT0.4lf c). 

The uniform one- dimensional gas is peculiar in some respects, one of these being 
that as density increases, the system becomes more like an ideal gas |86t IH7]. From 
(jl0.4j) and pO.HHj) the expected simulation time in units of the healing time is 

tsim ~ 5n'/% (10.35) 

at high lattice point occupation n. To resolve the correlation waves (with width 
^heai-j^ Qj^g requires Ax ~ ^heai^ which gives the scaling 

That is, at lower densities, it is more difficult to observe the long time correlation 
wave behavior that evolves on the timescale. This is borne out in Figure IT0.6lf b). 
and is the reason why the relatively weak correlations in the p = 100/^'^''^' have been 
investigated here in most detail. 

Lower density simulations were also carried out for p = i/^^'^'^^ and p = 10/^'^'^'^^ 
Qualitatively the same kind of correlation wave behavior was seen for these densities 
up to simulation time, but the correlations are much stronger. Some examples for 
p = l/^'^^^' are shown in Figure fT0.5l 

When the length and time scales are given in density- dependent dimensionless 
units x/^^*^^^ and t/t^, the scaling of correlations in the simulated parameter regimes 
was closely approximated by 

theal 

gU) ^ S . (10.37) 
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Figure 10.5: Correlation functions in a uniform ID gas with density p — 1/^'^'^^'. solid 
lines show g^^\ dashed lines show g'^', dotted lines show Ig^^-*]. Triple lines indicate error bars 
at one standard deviation significance. Subplot (a): Time dependence of local correlations, (b): 
Correlations at time t — t^^. Simulations with Ax = 0.5^'^*^^^ 50 lattice points, and S — 10'* 
trajectories. 



10.4.3 Lattice dependence of simulation time 

The observed scaling of simulation time with lattice spacing Ax and lattice occu- 
pation n = pAx is shown in Figure 110.61 and compared to the expected un-gauged 
times when kinetic effects are ignored. The estimated times shown in gray are based 
on the empirical relationship ()7.89|) with the positive P (i.e. un-gauged) fitting pa- 
rameters given in Table 17^ It can be seen that for the physical regimes simulated, 
this empirical estimate of simulation time is very good also for many-mode simu- 
lations. (Most of the residual variation in tgim is due to statistical effects of finite 
sample size). 



10.4.4 Diffusion gauge dependence of simulation time 

From the analysis of Section 110.2.31 it is expected that diffusion gauges will give 
significant improvements once Ax > ^heai_ ^j^jg jg confirmed in Figure IT0.7t which 
shows the dependence of simulation time on topt for some simulations using the 
gauge ()7.94|) . At large lattice spacings, significant gains can be made with the local 
diffusion gauge, while for spacing of the order of the healing length the improvement 
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10"^ 10° 10^ 10"^ 10° 10^ 10^ 10^ 

^^/^heal n = pAx 

Figure 10.6: Scaling of useful simulation time with lattice spacing Ax and mean lattice point 
occupation n = pAx. Data points for simulations of ID gases of densities 100/^'^°*^', lO/^'^''^', and 
2^^hcai^ are shown with CIRCLES, TRIANGLES, and SQUARES, respectively. S = 10* trajectories. 
Also shown for comparison as light lines are expected simulation times based on the empirical 
relationship H7.89|l for the single-mode positive P. The dotted line in subplot (b) gives the 
high-occupation estimate (|10.35|l . 




Figure 10.7: Improvement of simulation time with diffusion gauges Simulation times as 
a function of the target time iopt when using the diffusion (only) gauge H7.94(l . All simulations are 
of a p = 100/^'^''*' gas. In subplot (a): Ax — ^^oai g — iQi trajectories, and in (b), and 
Ax = 10^'^'=^' with O (500) trajectories. 
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(a) p = 100/(^heal)2^ ^ ^ 



(b) p = t^t^^ 



~ 0.9996. 
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Figure 10.8: Correlation functions in 2D uniform gases at times 4t^ and for gases 
of density 100/(^'^°^')^ and respectively. Coordinates in dimension d are denoted 

Xd- Simulations used a lattice spacing of Axd = 30 x 30 lattice points, and S — 10^ 

trajectories. [arXiv note: higher resultion available at Piotr Deuar's homepage, currently 
http : //www .physics .uq. edu. au/people/deuar /thesis/] 



is small. Note the similarity of Figure lTO. 71( b) to the single-mode case Figure ITl^lf a). 
In both cases simulation time increases steadily while topt < max[tsim], but drops off 
sharply once the critical topt is reached (topt ~ 80t^ in this case). 

10.4.5 Two-dimensional gas 

Simulation of a two-dimensional gas is also straightforward using the equations 
()5.17j) . Some examples of calculated correlation functions for 2D uniform gases with 
densities p = 100/ (^'^^^')^ and p = l/(^'^°'^^)^ are shown in Figure ITO.81 Correlation 
wave behavior similar to the ID case is seen. 
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10.5 Example 2: 

Extended interactions in a uniform gas 

The simulation of a gas with extended interparticle interactions is also possible with 
the gauge P representation, as described in Section 15. 51 One uses the equations 
()5.17p but with scattering effects modified to become the nonlinear terms ()5.47|) and 
noise terms given by ()5.46|1 and ()5.45|1 . 
A Gaussian interparticle potential 

„2n 



U{x) = ^ exp 



— X 

24 



(10.38) 



is assumed for the interparticle potential in this example. The bulk interaction 
energy density in a system with density varying much slower in x than U (x) is m = 
^ jU{x- y)^\x)^^{y)^{x)^{y)dxdy ~ ^ / U{x)dx. The potential (ITIOH|l has 
been normalized so that u is the same as for a local-only "delta-function" interaction 
with strength g. 

The simulated second order correlation function is shown in Figures riU.9l and llO.10l 
for a potential of standard deviation au = 3^'^*''^'. Density was again chosen p = 
100/^'^'^'^^ so that at large length scales ^ au, the behavior of the gas approaches 
the behavior of the locally-interacting model shown in Section 110.4.21 and Fig- 
ures CUIIl 110121 andirm 

Correlation wave phenomena are seen in this system as well, but with significant 
differences to the locally interacting gas of Section 110.4.21 

• The antibunching at short distance scales occurs out to a range O (au) rather 
than ^^'^'^^^ and takes on approximately a Gaussian form. See Figure (TO. 101( a) 
and (b), and compare to the linear (in x) growth of ^'^^-'(x) in Figure ITO. 41( a). 

• The strongest positive correlation wave moves at approximately the speed of 
sound Cbec; not at the faster rates seen in the locally- interacting gas. 

• A train of weaker correlation waves is created (at least two more wavefronts 
are seen in Figure fT0.9|) . but these form behind the strongest leading wavefront 
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Figure 10.9: Evolution of g^^^'lx) in a uniform ID gas with extended interparticle inter- 
actions (|l().38(l : au = 3^''°'''. Gas of density p = 100/^'''''''. Simulation was on a lattice 
with spacing Ax = ^hcai^ 200 lattice points (thus with 20 000 total particles on average), and 
S — 10^ trajectories. [arXiv note: higher resultion available at Piotr Deuar's homepage, currently 
http : //www . physics . uq . edu . au/people/deuar/ thesis/] 




Figure 10.10: Correlations at t — 35t^ for the uniform ID gas with extended interparticle 
interactions. Gas and simulation parameters as in Figure HO. 91 Triple lines indicate error bars at 
one standard deviation. The form of the interparticle potential is shown to scale in (a). 
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rather than in front of it, and these secondary waves appear to also move at 
the sound velocity. 

• Some decaying oscillations of ^'-^•'(O) occur. 

Phase coherence fFigure flO. 101( c)) decays in similar fashion to the locally- interacting 
gas. 

The differences in the long time behavior between this system and the locally 
interacting gas may be reconciled despite the potential having the same coarse- 
grained interaction strength g. An assumption made to arrive at an effectively 
locally- interacting gas was that the s-wave scattering length = mg/Anh"^ is much 
smaller than all relevant length scales - in this case e.g. ^'^^^i. This is satisfied for 
the simulations in Section HO. 4. 2[ but the equivalent quantity here (which is au) is 
not smaller than 

As in the gas with only local interactions on the lattice, long time behavior is 
best observed with high density but small correlations as in Figure 110.91 Much 
stronger correlations can be simulated, but for shorter times (scaling will again be 

^sim OC n ^ . 

Lastly, a convenient feature of these extended interaction simulations is that the 
useful simulation time is found to be significantly longer than for a system with local 
interactions of the same coarse-grained strength g. For example in the p = 100/ C,^'^'^^ 
system, tgim ~ 46t^ for the au = 3^^°^' gas, while the locally-interacting gas had 
tsim ~ 17^5 (Note that g = h/2pt^). 

10.6 Example 3: Correlations in a trap 

Simulations in an external trapping potential V°^^ pose no particular problem. For 
example, the following system was simulated: 

• Bosons are prepared in a harmonic trap with trapping potential 

y^-t(x) = ]^muol^x^, (10.39) 
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which has a harmonic oscillator length a^o = \/hlmijj\^a- Initially they are 
in the coherent zero temperature ground state obtained by solving the Gross- 
Pitaevskii mean field equations^. The mean number of atoms in the trap in 
this example is = 10. 

• The bosons experience two-body interactions with an effective Gaussian inter- 
particle potential (jlO.HHj) with radius ou = aho, and a strength g = 0.4/iahoi-i^ho- 

• At t = 0, breathing of the atomic cloud is induced by switching to a more 
confined harmonic potential with double the trapping frequency: i.e. V^^^ — »• 

Some data from the simulation are shown in Figures 110.111 and 110.121 

This model is in a regime where qualitative results are hard to achieve using 

approximate methods because several length scales from different processes are of the 

same order: Trap width is Oho, initial cloud width is ~ 2aho, interparticle scattering 

range is also Oho. 

Phenomena seen in this the simulation include: 

• The two-particle density correlations g^'^\0,x) display different behaviors in 
the contracting and expanding phase. See Figure ITO.llIf a). 

— When the particle cloud is contracting, antzbunching appears at the cen- 
ter of the trap, while during the later part of the contraction there is an 
enhanced likelihood of pairs of atoms with one in the outer region of the 
cloud and one in the center. 

— During expansion, on the other hand, the particles tend to bunch in the 
center of the trap (i.e. there is increased likelihood of two particles at 
small separation), while pairs of particles with one in the tails, one in the 
center are suppressed. 

^i.e. the stationary state of (|5.17|l with noise terms removed. — see Section l5^ for more on the 
correspondence between the deterministic part of the gauge P equations and the Gross-Pitaevskii 
equations. 
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Figure 10.11: Contour plots of the evolution of correlations and boson density in 

the trapped Bose gas described in Section 110.61 Correlations shown are between bosons in the 
center of the trap, and those a distance x from the center, (a): Contours of density correlations 
g^'^^0,x) with a spacing of 0.01. Solid lines indicate g^^^ > 1, dashed g^^^ < 1. (b): contours 
of mean density p{x) with spacing of l/aho- (c): contours of phase coherence |(7^^-'(0, < 1 with 
spacing 0.01 (note: g^^^{0,0) = 1). (d): contours of the relative condensate phase Zg'-^^0,x) with 
spacing tt/IO. Solid lines indicate < Zg^^^ < tt, dashed — tt < Z.g^^^ < 0. Z.g^^^ = contour 
omitted. Light broad lines are contours of density at 5% of the central value, and indicate the 
approximate extent of the boson cloud. Simulation with S = 10^ trajectories on a 60 point lattice 
with Ax — 0.2aho. Note that aho is the harmonic oscillator length of the initial cloud, while aho/2 
is the width of the narrower trap at t > with frequency 2who. 
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Figure 10.12: Local two-particle correlations in the center of the trap for the breathing 
condensate described in Section 110.61 Triple lines indicate error bars at one standard deviation. 
Parameters as in Figure [Tu. Ill 

• This bunching during the expansion and antibunching during the contraction 
appear counter-intuitive. The reason for this appears to be a time lag before 
the dominant effect makes itself felt in the correlations. This results in the 
bunching trailing the contraction by a significant part of the breathing period. 
Initially, the cloud is stationary, and during the contraction phase, there is a 
long period of time when the atoms are laregly in free fall, and interparticle 
repulsion dominates the correlations causing antibunching despite the contrac- 
tion. Eventually, the atoms become squashed together leading to bunching, 
but this occurs only at the end of the contraction phase. Bunching now re- 
mains for a large part of the expansion phase before it is finally overcome by 
the interparticle repulsion. 

• The oscillations of g^'^\0,0) (i.e. bunching at the center of the trap) due to 
the breathing of the atomic cloud become more pronounced with time — see 
Figure 110.121 This may indicate a resonance between the breathing and the 
repulsion, although it is also possible that this is a transient initial effect. 

• Coherence between the center of the trap and outlying regions of the cloud 
deteriorates as time proceeds - compare the first and second contraction phase 
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10.7 Example 4: 

Bosonic enhancement of atoms scattered from 
colliding condensates 

10.7.1 The Vogels et al four wave-mixing experiment 

In a recent experiment of Vogels et al 01] at MIT, strong coherent four-wave mixing 
between components of a Bose-Einstein condensate at different velocities was ob- 
served. The matter wave components were created by applying Bragg pulses to an 
initially stationary trapped condensate so as to impart a velocity 2v"'^ to approxi- 
mately half of the atoms, (v'^™ is their velocity with respect to the center of mass). 
A small seed population ((9(1%) of all the atoms) moving at velocity was also 
created. Bose-enhanced scattering of the atoms during the half-collision of the two 
main wavepackets led to (9 (10 x) coherent amplification of the of the seed population 
and the creation of a fourth coherent population at velocity V4 = 2v™ — v,,. 

As the wavepackets move through each other, scattering of atom pairs occurs 
into velocities v and 2v'^™ — v, with energy conservation favoring |v| ^ v*^™. If 
the seed wavepacket is present at some jv^l ^ v'^™ then scattering of atoms into its 
modes is preferred due to Bose enhancement, analogously to stimulated emission in 
photonic systems. 

In the experiment the initially empty momentum modes also acquired sufficient 
density after some time, so that Bose enhancement of scattering occurred into these 
non-seed modes as well. These momentum modes (which are much more numerous 
than the seed wave) eventually competed with the seed wavepacket and limited its 
growth. 

A first-principles method could be desirable to quantitatively describe the effect 
of the initially empty modes in this system. Such a calculation is difficult with ap- 
proximate methods as both single-particle effects (to occupy the empty modes in the 
first place), and subsequent amplification of the coherent many-particle wavefunc- 
tion are involved. The first process can be estimated with perturbative methods, 
and the second with mean field GP equations, but combining the two has been 
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difficult. 

Experimental parameters (relevant to the subsequent simulation described in 
the next subsection) were : ^ 30 000 000 atoms of ^^Na {ag = 2.75nm, m = 
3.82 X 10~^^kg) in the initial wavepacket. This was created in an axially symmetric 
longitudinal trap with frequencies uj/27t of 20Hz in the longitudinal and 80Hz in the 
axial directions. This trap is subsequently turned off, and the two main wavepackets 
created with the Bragg pulse move at a velocity iv'^™ relative to the center of mass, 
with v'^™ ^ lOmm/s. 

10.7.2 Simulating Bose-enhanced scattering into initially 
empty modes 

To study the enhanced scattering into the initially empty modes, an un-gauged 
positive P representation simulation was made with similar parameters as the Vogels 
et al experiment. There were two differences: 

• In the simulation there were 1.5 x 10^ atoms (on average), while there were 
~ 3 X 10^ atoms in the experiment. This smaller atom number was needed to 
achieve a long enough simulation time to see significant Bose enhancement of 
the scattered modes. 

• In the simulation no seed wave was placed. 

The simulation was carried out in the center-of-mass frame, and the initial coherent 
wavefunction was taken to be 

V'(x) = V'Gp(x)y2cos ^ "•^^'^-^ ^ ^ (10.40) 

where '?/'gp(x) is the coherent T = ground state solution of the Gross-Pitaevskii 
(GP) equation in a trap of the same dimensions as in the experiment, and the 
full number of atoms. The Bragg pulse used to impart velocity to the moving 
wavepacket was of short duration ^ 40/is, and so to a good approximation no 
significant evolution of the condensate took place during the pulse, hence ()10.40|) is 
a good approximation to the state of the system just after the pulse. The initial 
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Fi gure 10.13; Total number of scattered atoms TV^'^^*^ and (b): rate of scattering in the 

colliding BEC system of Section 110.71 solid line: scattered atoms calculated as described in 
Section 110.7.21 from a positive P simulation. Triple lines indicate uncertainty at one standard 
deviation, dashed line: Estimate obtained with the imaginary scattering length method 

wavepackets move in the longitudinal direction iv'^™ = [±v^™, 0, 0]. The simulation 
was on a 438 x 48 x 48 lattice with S = 200 trajectories, and lattice spacing of 
Axi = 0.2209/im in the longitudinal, and Ax2,3 = 0.4921/im in the axial directions. 
Simulation time is O (6days) on a PC of 2002 vintage. 

Figure ITO . 1 31 shows the change of scattering rate with time, and the total number 
of scattered atoms for this simulation. Since the overlap between the two wavepack- 
ets travelling at iv*^™ decreases as they move apart, and the rate of scattering into 
empty modes is proportional to this overlap, then the increasing scattering rate in 
the simulation is clear evidence of the beginning of Bose enhancement of 
the initially empty modes. 

An interesting technical difficulty occurs when estimating the number of scat- 
tered atoms. Just as in a real experiment, the atoms in the first-principles calculation 
are not labeled as "scattered" or "non-scattered" . Some of them are scattered back 
into momentum modes already occupied by the initial wavepackets, and cannot be 
separated from the unscattered atoms by counting. To nevertheless make an esti- 
mate of scattering rate, a similar procedure was used to what would be needed for 
experimental data. It was aimed to count only those atoms with momenta beyond 
the coherent wavepackets. Explicitly, momentum modes with longitudinal veloci- 
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ties differing by less than Imm/s from ±y™, and witli radial velocities of less than 
2.3mm/s in both orthogonal directions are excluded from the count. (For compari- 
son, the rms velocity deviation of atoms from iv'^™ = [±v^™, 0, 0] in the condensates 
is initially about 0.13mm/s and 0.5mm/s in the longitudinal and radial directions, 
although some later spreading occurs). This gives a scattered atom count A''^'^'^*. 

Furthermore, in a mean field GP equation calculation, there appear some atoms 
at momenta that would be counted as "scattered" with the above counting method. 
(These occur mainly in the far tails of the wavepacket momentum distribution due 
to spreading with time, and also at velocities ±3v^™ due to the stimulation of a weak 
scattering process v &v — > 3v & — v when v ^ iv'^™ to a state with short lifetime 
(due to lack of energy conservation). Initially their number is (9(200) initially, 
growing to (9(3000) at the end of the simulation). These GP background atoms, 
which are not due to the spontaneous scattering process of interest or its Bose 
enhancement, are subtracted away from the A'^^'^^* count obtained previously to arrive 
at the final scattered atoms estimate N^'^''^^ shown in Figure ITO.131 

To check how well the expression ()10.4p assesses simulation time a priori, let 
us see what estimate it gives in this more complex system. The peak density 
in the middle of the initial cloud in the Thomas-Fermi approximation is = 
fyjj\io{lhNas/ ciYioY^^ l^g. The maximum density is well approximated by twice this 
Thomas-Fermi peak density (the factor of two arises because the initial condi- 
tion p0.4()j) has local density peaks with twice the local average density). Using 
max[p(x)] = 2pg^ in ()10.4|) . one obtains an expected simulation time of 0.6ms, 
which agrees fairly well with the observed simulation time of 0.78ms. The extra 
simulation time as compared to the estimate may be due to the decrease of peak 
density with time as the wavepackets move apart. 

Lastly, the local diffusion gauges ()7.94|) and ()7.107p of Chapter [7| were not ex- 
pected (nor found) to give significant simulation time improvements for this calcula- 
tion, because the healing length is larger than the lattice spacing. (Using pg ^ again, 
one obtains ^^^^"^ ~ 0.6pm. The small lattice size used is needed to resolve the phase 
oscillations in the moving condensate. 
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10.7.3 Comparison to imaginary scattering length estimate 

The first-principles scattering rate is also compared to that obtained using the imag- 
inary scattering length technique |88j (see Figure IT0.13p . This approximate method 
applies to colliding BECs when (among other conditions) their relative velocity is 
much larger than the momentum spread in a single wavepacket. (Here, the packet 
velocity relative to the center of mass is ~ lOmm/s in the longitudinal direction, 
while the initial velocity spread in this direction is ~ 0.13mm/s rms). Scattering 
losses from the condensate wavefunction to empty modes are estimated from the 
GP equations by making the replacement 



in the (now complex) scattering strength, which leads to particle loss form the 
wavefunction. 

In Figure 110.131 one immediately sees that the scattering rate is significantly 
overestimated with this method. This difference is due to suppression of scattering 
to momenta lying close to the primary direction of motion iv'^™, which is not taken 
into account in the imaginary scattering length approximation. 

In Figure ITU.141 marginals of the (positive P) simulated momentum distribution 
are plotted, and in particular, density of the velocity component vi in the longi- 
tudinal direction is shown in subplot (c). One sees that the density of scattered 
atoms decreases (slowly) as |vi| v^™, whereas if the scattering was to an isotropic 
spherical shell of momenta, this linear density would rise towards v'j™. 

This suppression of scattering has been predicted by Bach et a/jSHI when con- 
sidering the scattering from two plane waves under the Bogoliubov approximation. 
They found that the suppression of scattering was dependent on the ratio r^; of ki- 
netic single-particle energy due to the plane wave motion with respect to the center 
of mass, and the interaction energy per particle. This ratio was 



a'g = as{l — ^Iklas) 



(10.41) 




(10.42) 



(10.43) 
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Figure 10.14: Velocity distribution of atoms at a time t = 0.783ms after the end of the Bragg 
pulse that separates the two condensate wavepackets. Subplot (a) shows the combined distribution 
of longitudinal velocity vi and velocity in one axial direction V2, after summing over all V3 values. 
Density of atoms grows as shading darkens, with no shading corresponding to vacuum. The 
high spatial frequency noise is due finite sample {S — 200) uncertainty rather than actual atoms. 
Subplot (b) shows the atom distribution in the axial directions, with summing over all longitudinal 
velocity values vi. Subplot (c) shows the distribution of longitudinal atom velocity p(vi). The noise 
is due to finite sample effects, and shows the degree of uncertainty in the calculated values. Data are 
from the positive P simulation described in Section [10.7.21 [arXiv note: higher resultion available 
at Piotr Deuar's homepage, currently http://www.physics.uq.edu.au/people/deuar/thesis/] 
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where p was the spatial density of each plane wave component. The suppression 
of scattering in the iv"^™ directions becomes less significant at large te, but is still 
strong when rE ^ O (10). For the system simulated here, te can be estimated by 
using as an estimate for p in ()10.43p . and gives te ^ 5, well in the regime where 
scattering along the direction iv'^™ direction is suppressed. 

A reduction of the magnitude of scattered particle momenta from the condensate 
values was also predicted in Ref. jHS] and is also seen in the simulation here (Note 
how the circle of scattered atoms in Figure IT0.14lf c^ lies at a slightly smaller radius 
in momentum space than the condensates.) 

10.8 Summary 

The above examples demonstrate that qualitative predictions of dynamics can be 
made with the gauge P representation (or the special case of the positive P represen- 
tation) for a wide range of many-mode interacting Bose gas systems. This includes 
predictions for spatial correlation functions (including non-local and/or high order) 
as well as local observables such as densities. Processes simulated included interac- 
tion with external trap potentials and two-body scattering under the influence of an 
extended interparticle potential, or of one acting only locally at each lattice point as 
in a Bose-Hubbard model. Spatial and momentum densities and their fluctuations 
have also been previously calculated for a lossy system with local interactions by 
Corney and Drummond |10l ^| using the un-gauged method. There, the onset of 
condensation in an evaporative cooling simulation was seen. 

The examples in this chapter show that such first-principles simulations can be 
tractable even with very large numbers of modes or particles in the system given the 
right conditions. This is evidenced by example 4 above, where there were 1009 152 
lattice points and on average 150 000 atoms. 

Situations where there are several length scales of similar order, or processes of 
similar strength are of particular interest for first-principles simulations because it 
is difficult to make accurate quantitative predictions otherwise. The examples of 
Sections 110.61 and 110.71 shown that predictions for systems in such regimes can be 
made with the present method. 
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Improvement of simulation times by use of local diffusion gauges was seen in 
Section 110.41 This occurs when the lattice spacing is of the order of the (local) 
healing length or greater. This confirms what was expected from the analysis of 
Section COTl 

In Section 110.2.11 an analytic estimate of useful simulation time with the un- 
gauged positive P method was derived. This was subsequently seen to be quite 
accurate both in the uniform gas with local interactions, and in the more complex 
colliding condensates system of Section 110.71 The simulation time is seen to scale 
as (AVy^^ with the effective volume of a lattice point AV^, so that coarse-grained 
simulations last longer. There is a tradeoff between a fine enough lattice to include 
all relevant processes, and one coarse enough for the simulation to last long enough 
to see the desired phenomena. The analysis of Section 110.2.21 also shows that ki- 
netic coupling between spatial modes (which can also reduce simulation time) is 
more dominant for fine lattices. Simulations with Gaussian extended interparticle 
interactions ()10.38|) were seen in Sections 110.51 and 110. 61 to last at least several times 
longer than those with the same bulk interaction energy density. 

Some of the phenomena predicted by the example simulations include: 

• The propagation of correlation disturbances ( "correlation waves" ) in a coher- 
ent gas without any corresponding density waves, and at a velocity ~ \/2 times 
faster than the speed of sound. (Section Il0.4|) 

• The complex interaction between the breathing motion of a condensate in a 
trap and the interparticle correlations in the condensate. (Section 110. 6|) 

• Bosonic enhancement of initially empty momentum modes during the collision 
of two condensates. (Section Il0.7j) 

• Suppression of spontaneous scattering processes in these colliding condensates 
in the direction of motion. (Section llO. 7. H|l 
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Chapter 11 

Thermodynamics of a 
one-dimensional Bose gas 

The one- dimensional (ID) regime has been recently observed in trapped Bose gases|HSl 
EHIini; giving strong impetus to theoretical studies of such systems. This chapter 
describes exact predictions of spatial correlation functions and momentum densities, 
which have been obtained for ID uniform Bose gases in a grand canonical ensemble 
by using gauge P simulations. 

A part of these results have been recently published [HT|. 

11.1 One-dimensional uniform Bose gas at finite 
temperature 

11.1.1 Exact solutions 

The interacting uniform ID Bose gas model described by the Hamiltonian 

H= — / — + - / ^^^(x)^^(x)dx (11.1) 
2mjdxdx 2 J ^ ' ^ ' ^ ' 

was found to be exactly solvable several decades ago, and is one of the few exactly 
solvable nontrivial many-body problems jHOj. The solutions were found in the pio- 
neering works of Girardeau inn 1^2] at (7 — > oo, Lieb and Liniger jHSl HH] at T = 0, 
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and by Yang and Yang [87] for the grand canonical ensemble at finite temperature 
T > 0. See also 11103 EE! for reviews. 

Explicitly, these solutions provide a numerical algorithm to calculate a density 
of states pk and holes ph- These can then be used to obtain some intensive physical 
quantities of the system such as density, energy density, and pressure. Recently 
there have been some further advances, especially in the zero temperature and strong 
interactions limit. These include T = expressions for the tails of the momentum 
distribution and short range ^'•^^(x) jHZI; T = expressions for g^^\0), g^^\0,0), and 

(a;) [HH] ; and finite T values of ^(^''(0)i99j. These last finite temperature ^'•^^(0) 
solutions indicate that there is a rich variety of correlation phenomena in the T > 
regime. 

Still, exact results for the vast majority of observables remain hitherto inaccessi- 
ble because they cannot be obtained from the density of states and holes (or perhaps, 
a way to obtain them has not yet been found). This includes such basic observables 
as spatial correlations and even momentum distributions for T > 0. 

The Hamiltonian (jll.lj) is a special case of ()2.H) . and as such, its grand canonical 
thermodynamics can be simulated using the gauge P equations ()5.50|) in a wide range 
of physical regimes. This has allowed the calculation of exact spatial correlation 
functions and momentum distributions for this model here — see Section 111. 41 

Noteworthy in this context are also the stochastic wavefunction simulations of 
Carusotto and CastinjHH], who calculated spatial correlation functions and 
g^^\x) of a different but related model: The canonical ensemble (i.e. with set particle 
number, as with all stochastic wavefunction calculations) of a uniform interacting 
gas in a system of small spatial extent where finite size effects were important. 

11.1.2 Correspondence with trapped gases 

Under present day experimental conditions, the Bose gas confinement is well ap- 
proximated by a harmonic trap of transverse angular frequency uj± and longitudinal 
frequency c^o- The correspondence between the uniform gas model, and a trapped 
gas experiment can be summarized in the following points: 
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The atoms behave effectively as a ID gas provided that the transverse zero 
point oscillations 



h = \l^ (11.2) 

are much smaller than other characteristic length scales of the system such as 
the healing length pi.9j) and the thermal de Broglie wavelength of excitations 
()11.8p . This regime has been reached in recent experiments [3H1 EH I 

On length scales smaller than the trap size 



^</o = a/-^ (11.3) 

the density of the gas is slowly varying. On this and smaller scales the ID 
trapped gas and uniform gas will show the same physical behavior, and so 
the results obtained in Section 111.41 will be directly applicable to a trapped 
gas while x <^ Iq. It is worth noting that if, instead, details of the trap are 
important, a gauge P simulation with the external potential V^^^ explicitly 
specified can be used instead. A dynamical example of this was given in 
Section 110.61 

• The field interaction strength g is related to the 3D scattering length as by 
the expression ()2.19|) . where the effective trap cross-section in the transverse 
direction ispjJOj 

o 2Trh 

Ao = 27r/i = . 11.4 

11.1.3 Uniform gas regimes 

The properties of a the finite temperature uniform ID gas with the Hamiltonian 
(jll.lj) depend on two dimensionless parameters: The first is the coupling param- 
eter^ 

^ = w ^''-'^ 

^Not to be confused with the unrelated single-boson loss rate 7 of Chapters El and H 7 is used 
here in Chapter ^] for the quantity IjlLSfl as this is the standard notation in the literature. 
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dependent on interaction strength g and density p. The ideal Bose gas is reached as 
7 — > 0, while the Tonks- Girardeau (TG) hard sphere gas occurs in the limit 7 — > 00. 
In this limit the particles undergo an effective "fermionization" |9H 1^ IH^l I93j. 
The second dimensionless parameter is the reduced temperature 

~ T mkeT 

^=7^ = 2;^' ^'^-'^ 
where Td is the quantum degeneracy temperature. For T < T^, the gas is denser than 



the quantum concentration uq = ^JmksT /2T{h? and the average spacing between 
particles 1/p is less than the thermal de Broglie wavelength ()11.8|) . 
Several characteristic length scales in the uniform g 



Mean interparticle spacing 



/p = ^. (11.7) 



Thermal de Broglie wavelength 



, 2TTh? If, 

\t=\ —— = ^. 11.8 

Atoms with typical energy behave as particles on length scales 3> At, as waves 
on <^ At 

Healing length 

e^"^' = = (11-9) 

which is the length scale of typical spatial density fluctuations. 

Degenerate phase coherence length 

/, = ^ = i, (11.10) 
2tt 2-nT 

which applies when T <^ 1 and 7 <^ 1. When T ^ 1, the phase coherence 
length is Ip = Xt/ \/2ti. 



282 



Chapter 11 Thermodynamics of a one-dimensional Bose gas 



• ID scattering length [T 

fi^ I 

aiD = — = ^ (11.11) 

mg 7 

is the characteristic distance over which the influence of a single particle is 
felt. In the fermionized large 7 regime, this is the effective size of the hard 
sphere due to a single particle. 

The ID gas behaves rather differently than the 2D and 3D cases, in that, for 
example, in a strong coupling regime the density is low. Also, a true BEC does not 
form, because long range order on length scales ^ is destroyed. This follows from 
the Bogoliubov theorem, as explained in |1()H 1102] . Nevertheless, at degenerate 
temperatures T < 1 and relatively weak couplings -C ^^^^^ and 7 <^ 1, a finite- 
range quasicondensate with phase coherence exists. 



11.2 Simulation details 
11.2.1 Overview 

Integration of the gauge P stochastic differential equations fl5.50|) proceeds in a sim- 
ilar fashion to the dynamics calculations of Chapter ^| These are based on the 
master equation formulation of the grand canonical ensemble, as described in Sec- 
tion EIEl A split-step algorithm is used, as described in Section llO.ll with integration 
using the semi-implicit method described in AppendixEl To exclude potential finite- 
sample bias of the kind described in Appendix El the variance of the logarithms of 
quantities to be averaged for observable estimates was also tracked. 

To simulate the uniform g needed to be taken to avoid finite size effects. 

This can usually be achieved for all practical purposes by choosing a lattice spacing 
several times smaller than the smallest length scale of Section [11.1.31 ^ind a number 
of lattice points large enough so that the total spatial extent is several times larger 
than the largest length scale. Operationally, the lattice resolution was increased in 
normal and/or momentum space until no changes with lattice resolution were seen 
in the observables of interest. 
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There are several further issues that arise only in the thermodynamics calcula- 
tions and require more detailed discussion: 

11.2.2 Drift Gauge 

As was discussed in more detail in Section l9.2.H the un-gauged positive P equations 
are not suitable for simulation because of severe moving singularities as Re {anPn} 
takes on negative values. This behavior appears in the many-mode well, 
and leads to rapid onset of spiking, as well as possible systematic errors after only 
a short simulation time. 

The radial drift gauge ()9.17|) acting locally on each mode was used to remove the 
moving singularities in the same manner as for the single- mode system of Chapter El 
Explicitly, when adapted to the many-mode situation this gauge is 

e?n = ZV^K/3n- |an/3n|), (11.12) 

where x is given by ()2.18|1 . as usual. 

Spiking abates, and boundary term errors are not seen. 

11.2.3 Importance sampling 

The simulated equations include evolution of both amplitudes and weight 

Q = e^°. Deterministic evolution of the weight is a new feature compared to dy- 
namics and can cause sampling problems at nonzero times r > 0. 

It is convenient to initialize in Fourier space. Here the variables 5h(t) and 5~(r) 
are given by ()l().2j) . and bold quantities indicate vectors with M elements — one per 
mode. For an M-mode system of length L, the standard initial distribution ()5.59p 
becomes in Fourier space 

Pcia, a', zo) = 6\zo)6''\d,' - 5) TT ^ exp (^t^) , (11.13) 

^J- TiriQ \ no J 

where no = L'^nQ/2TrM is the mean of all |5nP at r = 0. The lattice factors arise 
because the number of particles per Fourier mode is oc 2^^- A sample is generated 
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by the simple procedure 

an = \/%Vn = (11.14a) 

5^ = (11.14b) 

zo = 0, (11.14c) 

where the rj^ are independent complex Gaussian noises of variance unity. 

The problem is that this is a good sample of the r = distribution, but not nec- 
essarily so at later times. This is so even for an ideal gas when two-body interactions 
are absent. In the simplest case of the ideal gas with no external potential = 0, 
the equations of motion ()5.5()|1 can be solved exactly in Fourier space, which gives 

(11.15a) 
(11.15b) 

„„^.^E{|5HMr-|4°^r}, (11-15C) 

n 

where 

hn{r) = ^(r) - ^ (11.16) 

is the Gibbs factor exponent. 

For the ideal gas, the exact finite temperature solution consists of an independent 
thermal state with mean occupation for each Fourier mode. An unbiased sample 
of this is 



5fi( 




= 5f e'^- 


a'ni 


V) 


= 5i°) 




V) 


27rM 



5kW = 5fi(r) (11.17b) 
zo = 0, (11.17c) 

where for the ideal Bose gas, the mode occupations obey the Bose-Einstein distri- 
bution 

Mr)=nr{r) = ^_J^^_^ . (11.18) 

The actual simulated 5n(T) in pi.l5aj) are not necessarily anywhere near pi.l7aj) 
with Bose-Einstein occupations ()11.18|) . so the nonzero variable weight ()11.15c|) is 
needed to compensate for this. 
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This is fine in the hmit of infinite samples 5 — oo, but in practice the distribution 
of the weight Q = can be badly sampled once the variance in zq becomes too 
large. Since |5~°'*p is a Gaussian, then zq also will have a similar nature and the 
usual limit ()7.43|) for exponentials of a Gaussian applies: The weights (and hence, 
observable estimators, which all average quantities oc e^°) are well sampled only 
while 

var[Re{^o}] < 0(10). (11.19) 

A method known as importance sampling offers a way to alleviate this weight- 
ing problem if one is primarily interested in the gas around a particular target time 
tt (i.e. temperature l/ksT and chemical potential ^{jt))- The importance sampling 
approach is widely used for weighted stochastic integration, and a comprehensive 
discussion of this for general situations can be found in |lU3j . Section 7.8. In the 
case considered here, the idea is as follows: 

In the gauge P representation, the density matrix is written as 

p„(r) = j Pcir, z, zo)A{z, zo) d^^ z £zo, (11.20) 

where z stands for all the phase-space variables 5, 5'. The simulation here uses 
standard drift gauges, so that the evolution of the weight zq is actually determin- 
istically dependent on the evolution of the z, as given in ()4.43|1 . In the initial 
distribution ()11.13|) . zq = is independent of z. Together, these properties allow 
the z and Zq evolutions to be separated. Explicitly, 

^o{r,z) = " dr', (11.21) 

where dzQ is given by ()4.43p . After defining A{z, zq) = e^"A(2;), this allows us to 
rewrite ()11.20|) as 



Pu[r) 



£G(^,2)A(2)e"o("'^) (11.22) 



where the underlined quantities depend only on the amplitude variables z. 

The multiplying factor of the operator kernel A in ()11.22|) is P_gg^°j and in 
fact there is a whole range of possible initial distributions provided that this factor 
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remains the same. In particular we could start with a different initial distribution 
of the amplitude variables P_go{0,z) provided that this is compensated for in the 
initial weights by 

dzoir'^zir')) 



dr' 



dr' + log PGiO,z)- log Pgo{0,z). 



(11.23) 



Then (and this is the crucial point), if we could choose some initial distribution 
P_Go that leads to only a small spread in zq at target time tt, then the simulation 
would be well sampled there. (It would also be sampled much worse at r = 0, but 
this doesn't bother us). This then is the sampling according to importance (i.e. 
according to the situation at tt)- 

For the purposes of the simulations reported in this chapter, a fairly crude yet ef- 
fective importance sampling was applied: At relatively weak coupling, a very rough 
but useful estimate of the state is that the Fourier modes are uncoupled, and ther- 
mally distributed with mean occupations nji(rr). The equal- weight samples ()11.17|) 
correspond to the distribution 



27cM 
L2 



;il.24) 



with appropriate normalization. This is not yet quite what is wanted because the 
desired sampling distribution P_gq is to be at r = 0. An estimate of the initial 
distribution that leads to P_'g'{t^z) can be obtained by evolving ()11.24|1 back in 
time with only kinetic interactions. This is again rather rough, since deterministic 
interaction terms oc g have been omitted (not to mention noise), but is simple 
to carry out, and was found to be sufficient for the purposes of the preliminary 
calculations presented here. One obtains the sampling distribution 



Pgo(0, z) oc 5^^\a' - a) exp 



2tiM srp \0in 



L2 



(0) 



;il.25) 



where 



nf^ = rinirT) exp [-A„ - /iji(rr)rT] 



11.26) 



(Note that limT-^o[^n(T)T] = — A„). Using ()11.23|) . the initial variables are then 
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sampled according to 



5h(0) 




(11.27a) 



5^(0) 



5ji(0) 



(11.27b) 



^o(O) 



2ttM 



n 




(11.27c) 



For most of the simulations of Section ril.4| the target Fourier space distribution 
^n(TT) was just taken to be the plain ideal gas Bose- Einstein distribution ()11.18|1 . 
However, once the interactions become significant, this is not satisfactory because 
the actual interacting gas momentum distribution is far from the Bose-Einstein form 
(See e.g. Figure nXH|) . A better choice of ?Tfcj^(rr) is the density of states function 
Pk of the exact Yang & Yang solution[SZ|- In practice, to simplify the calculation, 
an estimate of pk was used instead. This was obtained by making a least squares 
(unweighted) fit of parameters Cest and cxest in the expression 



to Pk- This form was chosen because in the tails of the distribution in Fourier space 
when k'^TT ^ 1, n^~^ ^ no, and so the initial weights -2o(0) in ()11.27c|l are not 
dependent on the momentum cutoff. 

11.2.4 Momentum cutoff 

In a dynamical simulation, one requires that all occupied momentum modes are 
contained within the momentum cutoff k^^^ = ir/Axd- Strictly speaking, this cannot 
ever be satisfied in these thermodynamic simulations because the T — > oo ideal gas 
grand canonical ensemble is used as a starting condition. This has all momentum 
modes occupied up to infinite energy. This turns out not to be a practical problem 
because the precision of calculated results is limited anyway by the finite sample size 
S. It is then sufficient to increase the fc™^^ until all observable estimates calculated 
are invariant with k^^^ to the precision achievable with a given ensemble size. This 
was carried out. 



rih = not 




est 
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11.2.5 Chemical potential at intermediate temperatures 

As was noted in Section 19.41 if one is primarily interested in the behavior of the 
system at (or around) a given temperature tt, and chemical potential fi, then the 
values of fi at intermediate times t < tt can in principle be chosen at will. 

In practice, some choices lead to better precision per ensemble size than others. 
The analysis in Section l^Ol indicated that the single-mode simulations led to the best 
precision when the "effective" chemical potential /ig = was roughly constant 
during the simulation. It was assumed that a similar dependence occurs in the 
many-mode system, and so, accordingly a constant /Xg was chosen so that at tt the 
chemical potential becomes ^i{tt). Using ()9.2|1 . the constant /ie assumption leads to 

/^e = /^(tt) + — . (11.29) 
Tt 

It remains to choose the initial density po = MnQ{Xn)/L (with initial occupation 
uq given by ()2.33|) ). It was found that for target densities p{tt) a choice of initial 
density 

P« = o(^) (11.30) 

gave the best precision at tt in most cases. Details varied depending on system 
parameters, although a clear tradeoff between two noise generating processes was 
seen: 

1. When Po was too large, a lot of added randomness is introduced into Zq at 
early times, and precision is lost very rapidly once var[Re{2;o}] ^ C^(IO), 
as discussed in Section 17.41 and Appendix El This randomness arises largely 
independently of gauges or noise since the spread of in the initial thermal 
state is proportional to ^/po, and this then leads directly to a spread in the 
deterministic evolution terms of dzQ. 

2. When po is too small, excessive spread in the amplitude variables arises as well 
(which also then feeds into the log-weight ^o)- 
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11.2.6 Scaling of weight variance 

For uniform gas calculations it was found to be desirable to use the smallest system 
size that does not introduce finite size effects. The reason is that (apart from gauge- 
dependent terms), the weight f2(r) = exp JJ^ ^ dr' has the form of a Gibbs factor 
exp{{fiN — E) / ksT) . This can be verified by inspection of the equations ()5.50|) . The 
magnitude but also the spread in log-weights Zq will thus grow with energy E and 
particle number A^. Since var [Re {2^0}] must be < (9 (10) for good sampling of the 
distribution, this sets an upper limit on how large a system can be simulated. 

11.3 Physical regimes simulated 
11.3.1 Parameter targeting 

The procedure to calculate properties at given target gas parameters 7 and T was 
as follows: 

1. Three physical quantities specify units. For example, m, h, and ksT = 1/tt 
can be chosen unity. For the purposes of investigating gases at a given 7 and T, 
these can be chosen arbitrarily and properties of the whole family of physical 
gases with those 7 and T follow by scaling. 

2. The target time tt is determined by the choice of units above, and the re- 
maining essential parameters required to simulate ()5.50|) are g and ^{tt)- The 
coupling strength is determined by 



3. The target density is not a direct input parameter into the simulation, but it 
is has a one-to-one correspondence with ^{tt) (while in dimensionless units). 




(11.31) 



while the required target density is 




(11.32) 
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The Yang & Yang exact solutions jHZj provide an algorithm to calculate p(T, fi, g), 
and the chemical potential needed to obtain the desired density ()11.32|1 was 
calculated by numerically inverting this relation. 

11.3.2 Regimes attained 

A wide range of physical regimes were simulated directly with the procedure de- 
scribed in the previous sections. In terms of the two characteristic dimensionless 
gas parameters 7 and T, the physical regions that have been explored are shown in 
Figure 111.11 

In terms of the physical regimes classification of Kheruntsyan et al 199] based on 
^*^^^(0) behavior (These are described in more detail in Section ril.4.Hl . the accessible 
regions are both the quantum degenerate decoherent and classical decoher- 
ent regimes, as well as the nondegenerate fermionized (strong coupling) regime. 
Simulations also access transition regions between these. The interesting tran- 
sition region 7 ^ T ^ (9 (1) where several length scales are of the same order is 
also accessible in part. The lower limit of accessible T does not appear to be a 
hard limit, and more sophisticated importance sampling techniques and/or choices 
of /Ue(r) may be able to make some further inroads into lower temperature regions. 

The basic limiting factor is growth of the variance of the real and/or imaginary 
parts of the log-weight zq. As explained Section lll.2.6| there is a tradeoff between 
the variance of zq and the system size (e.g. lattice length L if the density is set), 
but there is a limit on how small the system can be made if it is to correspond 
to the uniform gas model. L must be significantly larger than the longest relevant 
length scale. However, this limit is only applicable if an unconstrained uniform 
gas is required. The behavior of a gas in a finite box, torus, or trap is an easier 
simulation and can reach temperatures not accessible for uniform gas simulations 
if L can be made small enough to control the growth of var [^o] • The previously 
mentioned calculations of by Carusotto and Castin[65j for the different but related 
closed system model were made in such a "low temperature with finite-size effects" 
regime, with 7^0 (0.001) - O (0.01), f^O (0.001) - O (0.005). 
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Figure 11.1: Physical regimes attained for the uniform interacting ID Bose gas. DARK SOLID 
Unas show the paths in 7, T parameter space taken by simulations for the r range where useful 
precision in g^^-* (x) was obtained. Paths shown include simulations with various forms of /igi either 



given by H11.29|) . or by 2e^^ 



cfJ.{TT)TT _ 



cos[47r(r — tt)/3tt] described in 



The constant form H11.29|l was found to be as good or superior at reaching low temperatures. 
Thick shaded bars indicate approximate transition regions between physical regimes based on 
the classification by Khcruntsyan et al 99 on the basis of ^'^'(0). 



11.4 Observable predictions 
11.4.1 Spatial correlation functions 

A classification of uniform gas behaviors based on the local second order correlation 
function 'g^^^ (0) has been determined from the Yang & Yang solution by Kheruntsyan 
et a/[99j. In this chapter, the finite temperature behavior of 'g^^\x), g^'^^{x), and 
g^^\x, y) has been calculated in several of these regimes and is shown in Figures [11.21 
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x/Xt x/\t 



Figure 11.2: Correlation functions in 7 = 5, T = 1000 decoherent classical gases. Exact 
results are shown as solid lines, with triple Unes indicating uncertainty (mostly not visible at this 
scale). DASHED lines show the ideal Bose gas values for comparison (g*-^-'(a;) is indistinguishable). 
Here, Ip = IOttAt- S = 10^ M = 80, L = SAt- 

to lll.7l Exact results are also compared to ideal gas values. The simulated physical 
regimes were: 

Classical decoherent regime (T ^ max[l,7^]) 

This regime occurs while the thermal wavelength dominates the small scale behavior 
of the particles. In particular, while it is smaller than the interparticle spacing Ip 
and the effective scattering length oid. Correlation functions in a fairly strongly 
interacting part of this regime are shown in Figure 111.21 Correlations decay on the 
length scale of At, with only a small reduction in close range correlation with respect 
to the ideal gas. 

Quantum degenerate decoherent regime (a/t/^tt^ -c T -C 1) 

Here the gas is degenerate, but phase coherence exists only on length scales shorter 
or of the same order as the healing length ^^^'^i. This is not enough for any significant 
quasicondensate to form. Example correlations shown in Figure 111.31 The short- 
range correlations are still largely thermal g^"^^ > 1, as for the classical gas, but a 
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Figure 11.3: Correlation functions in 7 = 0.03, T = 0.236 decoherent quantum degener- 
ate gases. Exact results are shown as solid lines, with triple lines indicating uncertainty (mostly 
not visible at this scale), dashed lines show the ideal Bose gas values for comparison. Here, 
At = 2mip. S = 10^ M = 40, L = 5.14^^. 



systematic reduction of the range of all multi-particle correlations in comparison to 
the ideal gas is seen. 



Fermionized nondegenerate regime (1 ^ T ^ 7^) 

Here the fermion-like nature of strongly repulsive atoms dominates the small-scale 
behavior of the gas. The effective scattering length am is much smaller than both the 
thermal wavelength and the interparticle spacing Ip leading to behavior similar to a 
hard-sphere (Tonks-Girardeau) model. In Figure ITl.4[ one sees strong antibunching 
on length scales oid, several times smaller than the phase coherence length. Three- 
particle correlations are also very strongly reduced at small distances < Oid, with 
I^^^n0,0)| < 0.2 in the example at 7 = 300, f = 6000 shown in Figure dl This 
will lead to a strong reduction in three-particle inelastic scattering losses despite the 
very high temperature. 
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Figure 11.4: Correlation functions in 7 = 300, T = 6000 strongly fermionized nonde- 
generate gases. Exact results are shown as solid lines, with triple lines indicating uncertainty. 
DASHED lines show the ideal Bose gas values for comparison {g^'^''{x) is indistinguishable). Here, 
Ip = 300aiD, At = 3.873aiD, and ^^cai ^ i2.24aiD. S = 10^ M = 36, i 14.7aiD. 

Transition regime with atom pairing (At ~ am ~ O (1)) 

Interesting new physical phenomena were seen in simulations of the transition regime 
where ^ T ^ O (1). Here all the length scales of the system noted in Section fl 1.1. HI 
are of similar order, the exact relationship between them depending fairly sensitively 
on 7 and T. Because of the presence several competing processes, approximate 
methods do not usually give accurate quantitative predictions. 

The transition between the nondegenerate fermionized gas and a decoherent 
quantum degenerate gas is shown in the sequence of Figures Hl. 41111.51 111. 61 and lll.31 
Details of the three-particle correlations in the still-fermionized part of the transition 
are shown in Figure IT 1.71 

There is a parameter regime where a peak in correlations is seen at finite sepa- 
rations, indicating the appearance of pairing between atoms. The pairing arises 
at interparticle distances of 

^pair ~ (11.33) 

when Xj- and ain are of the same order. It appears to be a consequence of competi- 
tion between repulsion, which promotes relative antibunching at short lengths, and 
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Figure 11.5: Correlation functions in 7 = 10, T ~ 20 fermionized nondegenerate gases. 

Exact results are shown as solid lines, with triple lines indicating uncertainty, dashed lines show 
the ideal Bose gas values for comparison. Here, Ip = IOoid, At = = 2.24aiD. S = 10^, 

M = 135, L = lO.TaiD. 

the inherent bunching in a thermal field on length scales < Ay. Enhanced pairing 
at distances /pair is seen to occur both in situations where the point density corre- 
lations 'g^^^ indicate local bunching (> 1 — e.g. Figure ITl.fj|l or antibunching (< 1 
— e.g. Figure ITT3|l . Enhanced three-particle correlations at distances ~ /pair were 
also seen. 

11.4.2 Momentum distributions 

An example of a calculated momentum distribution in the transition region is shown 
in Figure IT 1.81 At low momenta k < knp = Xt/V^jt, the distribution is seen to be 
intermediate between an the ideal gas Bose-Einstein distribution and the Fermi- 
Dirac distribution for the same /i and T. One also sees that the density of states 
function from the exact Yang & Yang solution jHTj differs significantly from the 
actual distribution of momenta. 

At intermediate momenta ^ k ^ '2kT, a depletion in comparison to the ideal 
gas is seen, while very high momentum particles k > 2kj- are more common than 
in the ideal gas. This regime is shown in Figure 111.91 and one sees that the high 
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Figure 11.6: Correlation functions in 7 = 1, T = 1.6 gases. Exact results are shown as SOLID 
lines, with triple lines indicating uncertainty (mostly not visible at this scale), dashed lines show 
the ideal Bose gas values for comparison. Here, Ip = aid. At = 0.793aiD, and ^^'^'^^ — 0.708aiD. 
S = 10^ M = 80, L = 7.07aiD. 




Figure 11.7: General three-particle correlation function in 7 = 10, T = 20 fermionized 
nondegenerate gases. Simulation details as in Figure ITl. 51 [arXiv note: higher resultion available 
at Piotr Deuar's homepage, currently h.ttp://www. physics. uq.edu.au/people/deuar/thesis/] 
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Figure 11.8: Momentum distribution in the 7 = 1, T = 1.6 gases. Solid lines indicate 
the results (with error bars) of a calculation with parameters as in Figure lll.61 Wave vectors k are 
scaled with respect to the thermal momentum kx — Xt/V^tt, while the momentum distribution 
is normalized so that J pn{k)dk — 1. Also shown for comparison are the Bose-Einstein ideal 
gas distribution (dashed), Fcrmi-Dirac distribution (dotted), and density of states pk from the 
Yang & Yang exact solution s? (dash-dotted), all calculated at the same T, /i, and g. These 
are normalized with the same factor as pn , rather than to unity. Subplot (b) shows the ratio with 
the ideal gas distribution p^^ to resolve the high k behavior. 



momentum distribution approaches the T = power-law decay p{k) (x k ^ found 
by Olshanii and DunjkojHZI from the exact Lieb and Liniger solution 

11.4.3 Comparison to exact Yang &; Yang solution 

As a test, these calculations have been compared with the density and energy per 
particle calculated from the Yang & Yang exact solution|HZI. This is shown in 
Figure lTl.lOl At and around the importance sampling target time tt, the agreement 
is excellent. As expected, at r ^ r^- the true distribution is badly sampled due to 
excessive spread in zq. The indicator used to catch this problem is var[Re{2;o}] 
(described in Section HO.lj) . which should he < O (10) to be confident of accuracy. 
For the case shown in Figure lTl.lOl this indicates accurate calculations at r > 0.89tt, 
which is quite conservative. A similar comparison with ^'•^•'(O) values calculated |99j 
from the exact solution also shows agreement. 
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Figure 11.9: Far tails of the momentum distribution in the 7 = 1, T = 1.6 gases. Param- 
eters as in Figure Hi. 81 solid triple lines indicate calculated normalized momentum distribution 
with error bars, dotted the Bose-Einstein ideal gas momentum distribution. The Dot-dashed 
line shows the T — high k asymptotic behavior j97j. 





Figure 11.10: Comparison of numerical calculation (solid lines with error bars) to exact 
solution (asterixes) for the 7 = 1, T = 1.6 calculation with parameters given in Figure IT 1.61 
(a): density p, (b): energy per particle £. Ideal Bose gas (dashed) and Fermi gas values also 
shown for comparison. The DASH-DOTTED vertical line indicates the lower limit on times r for 
which accuracy is expected with a finite sample on the basis of the var[Re{zo}] < 10 indicator 
(See Appendix El- 
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Chapter 12 
Conclusions 

12.1 Overview of main results 

As with the thesis, the main results obtained can be divided into three parts: 

1. Theoretical background work regarding general phase-space distri- 
butions. The first aim here has been to specify what freedoms are available 
to the broadly conceived phase-space distribution approach when it is to be 
used for first-principles simulations of mesoscopic quantum mechanics. The 
considerations of Chapter IHl led to a set of necessary requirements for any 
useful simulation, summarized in Section Some fundamental expressions 
(e.g. observable estimators) have been given for general representations (i.e. 
choices of kernel). The usefulness of a given representation (which is closely 
related to basis choice) is highly system dependent, and so it is hoped that the 
results and considerations of Chapter 01 will aid in matching representations 
to problems. 

With a given representation, often there still remains a wide range of stochas- 
tic equations that simulate the same physical problem, but strongly differ in 
efficiency. The stochastic gauge formalism developed in Chapter HI describes 
the freedoms available in a systematic way. At the coarsest level there are two 
main kinds: Those that arise when making the correspondences from mas- 
ter to Fokker-Planck equations (kernel gauges), and from Fokker-Planck to 
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stochastic equations (diffusion gauges). Some analysis of these freedoms for 
general gauge choices has also been made. Uses have been found to include 

(a) Improvement of efficiency by tailoring the shape of the distribution of 
random variables to improve sampling. 

(b) Removal of biases due to a broad class of so-called boundary term errors, 
which are due to instabilities or divergences in the stochastic equations. 

(c) Allowance for calculations of grand canonical ensemble properties by the 
inclusion of a dynamically varying (with simulation steps) weight. 

Part of the freedoms (e.g. real diffusion gauges) have been determined to 
not be useful. The stochastic gauge formalism includes in a systematic way 
several recent developments [3 [1111313123123 EI] as special cases. A relatively 
restricted set of gauges that still allows the uses listed above has been identified 
in Section 14.51 as the "standard gauges" . 

The causes and symptoms of boundary term errors (sampling biases that do 
not abate in the infinite sample number limit), have been considered in consid- 
erable detail in Chapter 13 These have been major obstacles for many phase- 
space simulations in the past. Two kinds of processes have been identified 
as causing these — either arising when making the correspondence between 
master and Fokker-Planck equations (first kind), or when calculating observ- 
able estimates using the simulated random variables (second kind). Several 
warning symptoms of these errors, which can be identified by inspection of the 
stochastic equations prior to simulation, have been identified. 

It has been found that appropriate choices of drift stochastic gauges (a class of 
Kernel gauge) can be used to remove the biases and instabilities in many (or 
possibly all) cases of boundary term errors of the first kind. This is demon- 
strated in several simple examples where boundary term errors have been 
known to occur, and also used in some subsequent many-mode calculations 
where required. Heuristic guidelines for appropriate gauge choice have been 
found and given in Section [6.3.21 
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2. Investigation of gauge P representation and development of gauges 
for use with interacting Bose gases. This representation is an extension of 
the successful positive P representation based on coherent states. The added 
complex weight factor is shown to allow the use of the standard gauges to 
improve simulations, remove possible boundary term bias, and extend the 
range of physical models that can be simulated (thermodynamic calculations). 

The emphasis here has been almost exclusively on simulations of open Bose 
gases with binary interactions. Equations for dynamics and thermodynam- 
ics calculations have been obtained (including non-delta-function interparticle 
potentials). Part B searched for useful choices of gauges for dynamics and 
thermodynamics calculations, given the constraint that they be local (i.e. de- 
pendent only on variables at a single lattice point). This constraint is not 
optimal, but is a basic starting point for possible further investigation. Both 
diffusion and drift gauges that improve simulations under appropriate condi- 
tions were found. Diffusion gauges were able to significantly improve sampling 
and useful simulation times in dynamics calculations, while drift gauges are 
essential for any accurate thermodynamic calculation, and were found to also 
give sampling improvement in dynamics under some conditions (See Chap- 
ter |H1). 

Analyses were made of the expected behavior of multi-mode simulations, and 
subsequently confirmed in the actual calculations. In dynamics calculations, 
simulation time is found to be limited by properties of the most highly occu- 
pied lattice point, and a robust estimate of this time for positive P simulations 
is given by ()1().4|1 . Relative to this positive P simulation time, it was found 
that simulation time improvement with diffusion gauges occurs when lattice 
spacing is greater than the healing length in the gas. Local drift gauges were 
found to give even more improvement in simulation time but for a much nar- 
rower range of systems in which two-body collision effects dominate kinetic 
processes, as in some simulations in Chapter |H| Thermodynamic simulations 
were made possible with judicious use of a drift gauge. It was also found that 
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simulation precision at a target temperature and chemical potential was depen- 
dent (sometimes strongly) on the (otherwise free) choice of chemical potential 
at higher temperatures. 

3. Simulations of many-mode interacting Bose gases Several nontrivial 
mesoscopic systems of open interacting Bose gases have been simulated. These 
were dynamics of: 

• Uniform ID and 2D gas with effectively local scattering. Wave behavior 
occurring in the two- or three-particle correlations (but not density) was 
found, and displays some interesting properties (e.g. movement of main 
wavefront at the speed of sound). 

• Uniform ID gas with extended nonlocal interparticle interactions. Corre- 
lation wave behavior was also seen, but with different properties than for 
the locally-interacting gas. Simulation time was found to increase signifi- 
cantly with respect to a locally-interacting gas with the same interaction 
energy density. 

• Trapped ID gas with extended interactions on the length scale of the 
trap. Strong interplay between the scattering and breathing of the atom 
cloud in the trap was seen. 

• Scattering of atoms from colliding BECs in 3D. Bosonic enhancement of 
initially empty modes was seen in a physical situation similar to the Vogels 
et al four wave mixing experiment |41j (the difference was that there were 
less atoms in the simulation). Additionally, a significant suppression of 
the spontaneous scattering process compared to the imaginary scattering 
length estimate is predicted. 

Also: 

• Calculations of grand canonical ensembles were made for the uniform ID 
gas at temperature T and chemical potential /i. Physical regimes reached 
included the nondegenerate strongly interacting fermionized regime, and 
the quantum degenerate decoherent regime. Spatial correlation functions 
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of first 'g^^^ (x), second ^(^^(^ and third order 'g^^^ (x, y) have been calculated 
as well as momentum distributions. A transition regime where enhanced 
atom pairing occurs at a preferred interparticle distance has been found. 
This occurs when the de Broglie thermal wavelength is of the same 
order as the ID scattering length oid (effectively the "size" of the atom 
for scattering processes). The pairing was found to occur at distances 
~ At/2 for the parameters simulated. 

These simulations can be tractable even with very large numbers of modes or par- 
ticles in the system given the right conditions, as evidenced by Section ri().7[ where 
there were 1009 152 lattice points and 150 000 atoms on average. 

Situations where there are several length scales of similar order, or processes of 
similar strength are of particular interest for first-principles simulations because it 
is difficult to make accurate quantitative predictions otherwise. The calculations in 
Sections 110. ()l and 110.71 and the thermodynamics calculations were of this type, and 
were seen to be amenable to simulation using the gauge P method. In fact, even in 
a perturbative regime (e.g. the correlation phenomena at low intensity can probably 
be treated by Bogoliubov approximation approach), the equations are convenient to 
use: One simply repeatedly simulates a GP equation with noise and takes appropri- 
ate averages. All quantum effects are included to within statistical precision. The 
caveat is that simulation time is limited. Nevertheless, many phenomena can still 
be seen, and simulation time can be extended with appropriate gauges under some 
conditions. 

12.2 Future directions of improvement 

The local diffusion gauges developed here give significant improvements only when 
the lattice is relatively sparse, and/or scattering interactions are dominant. A way 
to overcome these limitations might be found by using nonlocal diffusion gauges 
that would depend on neighboring, or preferably all, lattice amplitudes a^, /3n- 
A possible starting point would be to consider optimizing diffusion gauges in the 
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opposite regime of strong kinetic interactions and weak scattering. The equations for 
such a model are probably best considered in Fourier space where kinetic processes 
take on a simpler form. 

Another major issue is the reduction of simulation time with system size due to 
increasing weight or zq spread when drift gauges are used. (As was discussed in Sec- 
tion A starting point here may be to determine why the lattice-dependent 
diffusion gauge ()10.26|) did not give the expected simulation time improvements 
()10.28|1 . If the expected M~^^^ scaling could be achieved, many useful mesoscopic 
simulations should become accessible with a drift gauge. Such a drift-gauged sim- 
ulation would be free of moving singularities, and possibly also extend simulation 
time as was seen for the drift gauged one- and two-mode systems. 

A completely different but potentially very promising approach is to use non- 
coherent state kernels for the representation. Certainly in many low temperature 
regimes this would be a more efficient approach if viable representations could be 
found because the low temperature ground and low excited states are often far from 
coherent states. Some preliminary attempts with squeezed state kernels give mixed 
results UnU. 

For thermodynamics calculations of properties at and around a target tempera- 
ture and chemical potential, an important outstanding issue is to characterize the 
influence of the choice of /i(T) at intermediate temperature values. It is conceiv- 
able that a good choice of /i(T) may give dramatic improvement in precision and 
significantly extend the physical region that can be simulated. Some precision and 
simulation time improvement should also be obtained by using a more sophisticated 
importance sampling technique than the rough method used in Chapter El Some 
preliminary results indicate that using a Metropolis sampling algorithm |1 05 j extends 
the reachable 7, T regime PHEj- 
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Appendix A 

Exponentials of Gaussian random 
variables 



Gauge P simulations of interacting Bose gases (whether dynamic or thermodynamic) 
involve multiplicative noise terms of the form 



where z is a complex variable (an, Pn, zq), and c a constant. This leads to \ogz 
being distributed approximately as a Gaussian. This is exact at short times, but 
some modification of the distribution occur also as a consequence of the other ". . . " 
drift (or noise) terms in the equations. Observable estimates are usually obtained 
through averages of non-logarithmic combinations of z, however, and this requires 
some care. Let us consider the idealized situation of ^ ~ f ^ = Vqc'^^ = e"^ , where 
^ is a Gaussian random variable of mean zero, variance unity. The notation of 
Section EH will be used. 

Using the distribution of ^, ()7.3(ij) . one obtains 



sampled correctly if ^ is chosen according to its Gaussian distribution. With S 
samples, the greatest value of obtained can be expected to be ^max, the solution 
of i[l - erf(^rnax/y2)] ~ 2/5, where erf(a;) = 2 e-^^dt is the error function. For 



dz = czdW{t) + 



(A.l) 




(A.2) 



This distribution falls off slowly as 



oo, and for a finite sample may not be 
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Figure A.l: Finite sample estimates of means of exponentials of gaussian random variables 
Va = e°^^. SOLID lines: S = 10^ sample estimate of {va)^^^^]^ and error bars, where ^ was sampled 
from a Gaussian distribution, dotted lines: Same but using S = 1000 samples, dashed lines: 
Exact {S — > oo) value of the mean (va) — » Wg., along with error estimates for the S = 10^ calculation 
based on the exact value of var [va], from H7.39|l and H7.40|l . This corresponds to what would be 
expected from an unbiased sample of Va-. All error bars were calculated according to 17.41|l . Subplot 
(b) shows the same values, but scaled with respect to the 5 — > oo mean Va-. Values at all a were 
made with the same sample of ^ to show the direct dependence on a only. 



S ^ O (1000) to O (10^), this gives ^^ax ~ 3 to 4. Some bias in ('y<T)stoch (underes- 
timation) may be expected due to the lack of samphng of the far tails once 

roo 

/ Ft{v^) dv^:>l/S. (A.3) 

J VQ exp((Tgniax) 

For typical sample sizes S ^ O (1000) to O (10^) this will usually occur around 
o"^ > 10. This is actually the same large a region given by ()7.43|1 where all precision 
is lost anyway because of excessive distribution spread (as shown in Section IT!^ . 

However, the problem is that both the mean (fo-)stoch CLT precision 

estimate ■>/ var [va] / S are underestimated when a is large, but the error estimate 
is underestimated by an even greater amount than the mean. As a result, badly 
sampled data may still appear to be significant for large a values. The situation is 
shown in Figure \K7l\ 

The simplest practical solution is to simply discard any calculated means {z)^^^^^^ 
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when 

var[log|2|] > 10. (A.4) 

If an unbiased mean {z)^^^^^ and precision estimate Az was obtained by some more 
sophisticated method, then it would not be significant anyway since one expects 
Az ^ (-2) stoch these large a values. The main point here is that quantities ()A.4|) 
should be monitored when dealing with multiplicative noise of the form (jA.ljl in the 
stochastic equations. 

Some subtleties can arise because in realistic simulations logz is not exactly 
Gaussian. If the large logz distribution tails fall off more rapidly than Gaussian, 
then an unbiased simulation can be obtained for larger var [log 1^1], while the converse 
is true if these tails fall of less rapidly. A more robust indicator of possible bias 
than ()A.4|) is to compare the observable estimate obtained with two sample sizes 
S, differing by at least an order of magnitude. If sampling bias is present, the 
two estimates of the average (2;) stoch ^^^^ usually differ by a statistically significant 
amount. An example of this for the idealized model is shown in Figure \Ka\ This 
kind of sample-size-dependent behavior is always a strong warning sign. 
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Appendix B 

Some details of Stochastic 
Calculus 

Some results regarding random terms in differential equations relevant to the thesis 
are gathered here. Proofs of these can be found in Gardiner|53j. with details on 
computer algorithms in j82j . 

In many cases, a set of stochastic differential equations (i.e. differential equations 
with random terms) can be written in the Langevin form 

dxj{t) = Aj{x, t)dt + Y^ 5jfc(x, t) dWkit), (B.l) 

k 

where x contains all variables Xj. The dWj{t) are Wiener increments and ()B.1|) is 
to be interpreted according to the Ito calculus (see below). 

A Wiener increment is defined in terms of the Wiener process W{t), which is the 
special case of ()B.1|) with j = 1, Ai = and Bi^ = Ski- The probability distribution 
of W is governed by the Fokker-Planck equation (FPE) 

dPw{W,t) _ d^Pw{W,t) 

dt ~ dW ' ^ ^ 

a special case of Brownian motion. The individual realizations of W are continu- 
ous but not different iable. The (infinitesimal) Wiener increment can, however, be 
defined, and is 



dW{t) = W{t + dt) - W{t), 
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with dt infinitesimal. This random quantity has the expectation values 

(^W^(i))stoch = (B.4a) 
{dW{tr\,^,^ = dt (B.4b) 
(rfW(t)rfl^(t')),toch = iit^t' (B.4c) 

^max[j]>2 \ 

Y[ dW{tj)\ = (B.4d) 

J=l / stoch 

The Wiener increment is related to processes with white noise correlations such 
that if (^(t)^(t'))stoch = ^(^ - then one can write ^{t) = dW{t)/dt. In flRT|l . 
the Wiener increments are independent: {dWjdWk^j)^^^^-^ = (c?W^j)stoch ('^^fc)stoch 
etc. In a numerical calculation, the Wiener increment is usually implemented as 
independent Gaussian random variables AWj at each time step of length At with 
mean zero and variance 

{AW.AW,)^,^^^ = AM,,, (B.5) 

although other choices of the distribution of AWj are possible provided only that 
the discrete step analogues of ()B.4|) are satisfied, as in ()B.5|) . 

An equation (jB.lj) in the Ito calculus is equivalent to the Stratonovich calculus 
equation 

dxj (t) = Aj (x, t)dt + ^ Bjk (x, t) dWk (t) + Sj (x, t) 

k 

= dxf°it) + Sji^,t), (B.6) 
where the Stratonovich correction is 

^.(x,t) = -|$:5...(x,t)^5|^. (B.7) 

kl ' 

These two forms arise from different ways of defining the integral of the differential 
equations, both useful. For practical purposes, the main differences are that: 

• In the Ito calculus, the time-dependent variables Xj{t) are independent of 
the same-time Wiener increments Wk{t), while this is not the case in the 
Stratonovich calculus. 
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• In the Stratonovich calculus, the Xj obey the usual rules of deterministic cal- 
culus, in particular the chain rule. In the Ito calculus one instead has 

dy{^) = j:dx,{^,t)^ + ^J:BA^^')Bi.M^- (B.8) 

j jkl 

Because of the first point above, the Ito calculus corresponds to a discrete step 
integration algorithm with the derivative approximated using values at the beginning 
of the time interval: 

Xj{t + /\t) =Xj(t) +Aj(x(t),t)At + J^Sjfe(x(t),t)AVrfc(t). (B.9) 

k 

Implicit integration algorithms estimate the derivative using quantities evaluated at 
later times^ 

t^ = t + KAt (B.lOa) 
(where k G (0, 1]) during the timestep. That is, 

xj{t + At) = x,{t) + A,(x(t,),t,) At + ^5,fe(x(t,),t,) AWk{t). (B.lOb) 

k 

In this case, one must use a Stratonovich (or Stratonovich-like) form of the drift: 

dxj{t) = dxf°{t) + 2KSj{x,t) (B.ll) 

to ensure that ()B.10b|) is the same as ()B.9|) up to lowest deterministic order At. This 
follows by use of ()B.5|1 on (jB.10b|l . For multiplicative noise it has been found that 
using a semi-implicit method {k = |) integration method gives superior numerical 
stability IH21- 



common algorithm for implicitly estimating the x{t^) is to iterate (|B.10b|l several times 
using the appropriate smaller timestep At — > KAt. 
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Appendix C 

Quotients of means or random 
variables 



In the gauge P representation, estimates of the expectation value of an observable 
O are made by evaluating the expression ()3.14|) for a finite ensemble of S samples. 
This has the form of a quotient 

stoch stoch /T^ 1 ^ 

Error estimates are made by subensemble averaging as explained in Section 13.3.21 
While for dynamics calculations, one is assured of (Re = 1 in the limit 

iS — s> oo at all times, so that the denominator can be ignored, thermodynamics 
calculations require the evaluation of both denominator (/d) stoch numerator in 
(jdlll . In particular, the error estimate ()3.18j) using subensembles requires some 
care. 

The problem is that if there are not enough samples s in (say) the jth subensem- 
ble, then the jth denominator average may have the wrong sign or be close to zero. 
The result is that this jth subensemble average O^"''' takes on very large or non- 
realistic values, which are nowhere near (O). Since their absolute values can be 
very large if the denominator was close to zero, these outlier O strongly influence 
the final error estimate. An example of such excessive error estimates is shown in 
Figure lUTl and typically goes in hand with spiking in the uncertainty AO. 

Roughly, one expects that if there are Se ^ 1000 subensembles, then the far- 
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Figure C.l; Error estimates with difTerent subensemble sizes s. SOLID lines show the 
means and error esthnates obtained using (|3.18() for the quantity = 1/ + 0.2), where 

v{t) obeys the Brownian motion stochastic equation dv = dW{t) {dW{t) is a Wiener increment). 
The standard deviation of w is ct = \/t. In each simulation there were iS = 10^ trajectories in all. 
Expression (|C.2|I suggests s > 2250 is required at t = 10. 



thermost outlier will be at about 3a from the mean. Thus, excessive uncertainty is 

> 



likely to occur once 3 W var / 



D 



(/z))stoch' where fj^ is the denominator average 
for the jth subensemble. For subensembles with s elements, this gives the rough 
limit 



s>9 



var [/ 



D 



(C.2) 



(/£')stoch 

Often the variation in the weight Q = is mainly due to a close-to-gaussian 
distributed Re {2:0}. In this case, using fj7.39|) and ()7.40p . the requirement ()C.2|) 
becomes 



s > 9 (e™'-[R^^^o>l - 1) . 



(C.3) 



